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ABSTRACT 

When the source in a four-image gravitational lens system lies sufficiently close to a “fold” caustic, two of the 
lensed images lie very close together. If the lens potential is smooth on the scale of the separation between the 
two close images, the difference between their fluxes should approximately vanish, Rfoid = (F+-F-)/{F++F-) « 
0. (The subscript indicates the image parity.) Violations of this “fold relation” in observed lenses are thought to 
indicate the presence of structure on scales smaller than the separation between the close images. We present 
a detailed study of the fold relation in realistic smooth lenses, finding it to be more subtle and rich than was 
previously realized. The degree to which Rfou can differ from zero for smooth lenses depends not only on 
the distance of the source from the caustic, but also on its location along the caustic, and then on the angular 
structure of the lens potential (ellipticity, multipole modes, and external shear). Since the source position is 
unobservable, it is impossible to say from Rfou alone whether the flux ratios in an observed lens are anomalous 
or not. Instead, we must consider the full distribution of Rfoid values that can be obtained from smooth lens 
potentials that reproduce the separation di between the two close images and the distance d 2 to the next nearest 
image. (By reducing the image configuration to these two numbers, we limit our model dependence and obtain 
a generic analysis.) We show that the generic features of this distribution can be understood, which means that 
the fold relation provides a robust probe of small-scale structure in lens galaxies. We then compute the full dis¬ 
tribution using Monte Carlo simulations of realistic smooth lenses. Comparing these predictions with the data, 
we find that five of the the 12 known lenses with fold configurations have flux ratio anomalies; B0712H-472, 
SDSS 0924H-0219, PG 1115 h-080, B1555h- 375, and B1933 h- 503. Combining this with our previous analysis 
revealing anomalies in three of the four known lenses with cusp configurations, we conclude that at least half 
(8/16) of all four-image lenses that admit generic, local analyses exhibit flux ratio anomalies. The fold and 
cusp relations do not reveal the nature of the implied small-scale structure, but do provide the formal foundation 
for substructure studies, and also indicate which lenses deserve further study. Although our focus is on close 
pairs of images, we show that the fold relation can be used — with great care — to analyze all image pairs in 
all 22 known four-image lenses and reveal lenses with some sort of interesting structure. 

Subject headings: cosmology: theory — dark matter — galaxies: formation — gravitational lensing — large- 
scale structure of universe 


1. INTRODUCTION 

Once baffling, the flux ratios between the images in four- 
image gravitational lens systems have recently become a 
source of considerable excitement. During the 1990s, stan¬ 
dard smooth lens models (using ellipsoidal lens galaxies, 
plus tidal shear from lens environments) successfully han¬ 
dled ever-improving data on the number and relative posi¬ 
tions of lensed images, but consistently failed to fit the im¬ 
age fluxes. The first step tow ard solving this problem came 
when iMao & Schneideii ( lT998l) realized that small-scale struc¬ 
ture in lens galaxies, which had previously been neglected, 
could easily explain the “ano malous” flux ratios. The ex- 
citement beg an in earnest when iMetcalf & MadaiJ il2flfl1ll and 
IChibal j200^ pointed out that the Cold Dark Matter (CDM) 
paradigm might naturally explain the sort of substructure re- 
quired to fit the fluxes. Soon after. lDalal & Kochaneklil2flfl2h 
introduced a method of analyzing lens data to measure the 
properties of substructure. They concluded that 2.0^j 4 per¬ 
cent (at 90% confidence) of the mass in lens galaxies is con¬ 
tained in substructure, which seemed to agree with CDM pre- 
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dictions, and to reveal that the so-call ed “missing” satellites 
(iMoore et al.lll999llKlvDin et alJI 19991) are in fact present but 
dark. Anomalous flux ratios had become a powerful test of 
CDM on small scales, and potentially a unique probe of the 
fundamental nature of dark matter. 

Before carrying the conclusions too far, though, we must 
recall that there are many links in the chain of logic from ob¬ 
servations of flux ratio anomalies to tests of CDM that need 
to be filled in. First, we must identify lenses with anomalous 
flux ratios. Second, we should list all the different types of 
small-scale structure^ that might create flux ratio anomalies, 
and understand what observations or analyses could distin¬ 
guish between them. Third, we must see if present data do 
distinguish different types of small-scale structure. If so, we 
can then quantify the amount of small-scale structure present 
in real lens galaxies. By comparing the inferred nature and 
abundance of small-scale structure to theoretical predictions, 
we can test the CDM paradigm. Finally, if we can understand 
how the predictions depend on the assumption that dark mat¬ 
ter is cold and collisionless, we may be able to use lensing to 
p robe the fundamental prope rties of the dark matter particle. 

iDalal & Kochanetd (l2flfl2h were the first to construct a re¬ 
alization of the full chain of logic. Briefly, they identified 

^ The term “substructure” seems to have come to represent the sort of 
small-scale structure predicted by CDM. The term “small-scale structure” en¬ 
compasses more general sorts of structure such as multipole modes, isophote 
twists, tidal streams, etc., so it is our term of choice. 
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anomalous flux ratios as those that could not be fit with stan¬ 
dard lens models. They focused on radio flux ratios in or¬ 
der to ignore microstructure associated with individual stars 
in lens galaxies, and assumed that the only important small- 
scale structure is dark matter clumps of the sort predicted by 
CDM. (They argued that other sorts of small-scale structure, 
such as globular clusters and dwarf galaxies, are much less 
abundant than the inferred number of CDM clumps.) They 
assumed that the amount of CDM substructure is a universal 
fraction of the total density, used the lens observations to place 
constraints on that fraction, and then compared their results 
with predictions from CDM simulations. Making the various 
assumptions was necessary to build the first connection be¬ 
tween lens flux ratios and the nature of dark matter. However, 
questions have been raised about some of them, which prompt 
us to go back and reassess each link in the chain. This eval¬ 
uation is essential if we want to claim lensing as a reliable 
probe of small-scale structure in the universe. Moreover, it 
is intrinsically interesting because it will lead us to a deeper 
understanding of diverse topics in both lensing and structure 
formation theory. 

Let us first consider the CDM end of the chain. There has 
been a surge of interest in refining predictions about sub¬ 
structure. It now appears that the substructure mass frac¬ 
tion need not be universal, but may vary bot h within a 


giyen halo and from one halo to another (e.g 

.. IChen et al. 

2003 

JZentner & Bullockl2003UMao et alJ2004t 

Oguri & Lee 

2004 

). Tidal forces might be able to destroy dark matter 


clumps at the small radii where lensed images typically ap¬ 
pear, in which case CDM might predict to o little substruc¬ 
ture to explain obse rved flux ratio anomalies JMao et al.l2004 
lAmara et^l2004) . If so, we should consider whether small 
halos projected along the line of sight can provi de sufficient 
small- scale structure. The situation is unclear, as IChen et alJ 
( 120031) claim that the millile nsing optical dept h from the line 
of sight is fairly small, while lMetcalflil200.5alhh claims that in¬ 
terloping structures are sufficient to explain flux ratio anoma¬ 
lies. Another possibility is that revised analyses of lens data 
may lower the required amount of small-scale structure (see 
below). A third possibility, of course, is that lensing and CDM 
simply disagree about small-scale structure. In any case, the 
important point is that the CDM predictions are challeng¬ 
ing and still somewhat uncertain, and more work needs to be 
done. Mastering the theory involves both technical issues (nu¬ 
merical resolution) and physical effects (dynamical friction, 
tidal disruption), so it is not only essential for interpreting the 
lensing results, but also interesting in its own right. 

Now moving to the lensing side, we must first ask whether 
flux ratio anomalies are real. lEvans & Witll (120031) recently 
suggested that at least some of the “anomalies” might just 
be artifacts of certain assumptions in standard lens models. 
Specifically, instead of assuming the usual elliptical symme¬ 
try, they allowed perturbations from m = 3 and m = 4 multipole 
modes. Such modes are not only observed in the luminosity 
distributions of real ga laxies dBender et alJl98^ISaglia et alJ 
ll993URest et alJ2001l) . but also predicted in the mass distribu¬ 
tions of simulated galaxies dHevl et al.ll994trNaab & Burkerll 
120031) . s o it does not see m unreasonable to allow them in lens 
models. lEvans & Wittl found that they could fit two of the 
three “anomalo us” lenses they considered, without substruc¬ 
ture. However, ICongdon & KeetoT] d2005h found that mul- 
tipole models fail t o expl ain the strongest anomalies. Also, 
iKochanek & Dalall d2004) argued that even low-order multi¬ 
pole modes cannot explain an important statistical property 


of the ensemble of flux ratio anomalies: an asymmetry be¬ 
tween images that form at minima of the time delay surface 
and those that form at saddle points, such that anomalous min¬ 
ima are almost always brighter than expected while anoma¬ 
lous saddles are usually fainter than expected. To its credit, 
the CDM substructure hypothesis, and stellar microlensing , 
can both explain such an asymme try (lMetcalf_&Madau|2()0l|; 
Sche chter & Wambsgan^r2002t iKeetonI 120031: iBradaC et al.l 
20041) . However, it is not yet known whether alternative hy¬ 
potheses co uld explain the asymmetry as w ell. In a different 
response to lEvans & Witd IYoo et al.l d2005l) recently showed 
that in PG 1115-)-080 the Einstein ring image of the quasar 
host galaxy rules out the sorts of multipole modes that would 
be needed to fit the quasar flux ratios. This type of analysis 
is very promising, but it demands deep, high-resolution, near- 
infrared observations combined with a sophisticated modeling 
analysis, and it must be applied on a case-by-case basis. 

Clearly, a top priority must be to develop methods to de¬ 
termine whether flux ratio anomalies are real and indicate 
small-scale structure. One approach is to look for new data 
that cleanly reveal small-scale structure. The most unam¬ 
biguous situation is the detection of flux perturbations asso¬ 
ciated with microlensing by stars in the lens galaxy. De¬ 
tecting time variability in optical fluxes can prove that mi¬ 
crolen sing occurs (e.g.. lWozniak et ^l200fll: ISchechter et alJ 
l2003l) . Barring that, the next best thing is to take optical 
spectra of lensed images and use similarities or differences 
between emission line and continuum flux ratios to distin¬ 
guish between microlensing, millilensing (a term sometimes 
applied to flux perturbations caused b y CDM-tvpe substruc¬ 
ture), and errors in the macromodel JMoustakas & Metcald 
l2003l) . The required obseryations are challenging, but the 


method does 

appear to be successful (IWisotzki et a 

.1120031: 

Metcalf et al. 

120041 iMorgan et alJl2004i IWayth et al. 

\2OO5i 

Keeton et all 

20051). At this point, it is appropriate to note 


that the “more data” program has made it possible to con¬ 
clude that, whateyer their lensing interpretation may be, flux 
ratio anomalies are not electromagnetic phenomena. Mea¬ 
surements of flux ratios at different epochs and wayelengths 
haye shown that differential extinction and scattering cannot 
explain the unusual obseryed flu x ratios fseelKo onmans et alJ 
I2003at l&^anek & Dalalll2()()l IChiba et al.ll2005l and Ap- 
pendix B). 

An alternate approach is to reanalyze existing data. Tra¬ 
ditionally, flux ratio anomalies haye been identified as 
those tha t cannot be fit with cer t ain smooth lens mod¬ 
els fe.g.. iDalal & Kochan^ UMj iMetcalf & Zhjlll^n?)! 
IKochanek & Dalaifl200^ That analysis is, of course, sus¬ 
ceptible to the criticism of being model-dependent. Per¬ 
haps eyen more important, it may be sensitiye to certain 
global symmetries in the popular lens models that lead to 
globa l relations among the magnifications of the four im¬ 
ages dPal all 1 9981: 1 W & MaolDOnnl Ifhinter & Evansll2(i()Tt 
lEyans & Huntel^2002^ . Eailure to fit obseryed flux ratios may 

simply indicate failure of the global symmetries — which 
is yery different from saying that there must be small-scale 
structure. To circumyent both of these problems, we would 
like to deyelop an analysis that is both local in the sense that 
it only depends on properties of the lens potential around and 
between closely-spaced images, and generic in the sense that 
it does not depend on any specific properties of the types of 
models that are used to analyze the data. 

Eortunately, lens theory has uncoyered precisely what we 
need: local and generic relations between the magnifications 
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between certain images in certain configurations. Specifically, 
two images in a “fold pair” (defined in 0 should have mag¬ 
nifications and that satisfy the approximation relation 
Ima| “ |mb| ~ 0 ; while three images in a “cusp triplet” should 
have I Pa I “ I Mb I +1 Me I ~ 0- This cusp relation p layed a central 
role in the analysis bv iMao & Schneid^il 19981) that led to the 
idea that lens flux ratios may probe small-scale structure. If 
we want to use the fold and cusp relations today, however, we 
must rigorously understand how “local” and “generic” they 
really are, and whether they can actually be used as the ba¬ 
sis of a realistic but robust method for identifying flux ratio 
anomalies that indicate small-scale structure. 


The standard fold and cusp relations are derived from 
low-order Taylor series expansions in the vicinity of a fold 


or cusp caustic dBlandford & NaravanI 11986 

Im^ fT9^ 

Schneider & Weisgll992HSchneider et aljll992 

; iPetters et al J 

200lHGaudi & Pettersl2002albl). Formallv, thev 

are only valid 


when the source lies asymptotically close to a caustic. We 
have undertaken to reexamine the relations in more realistic 
settings, when the source sits a small but finite distance from 
a caustic, and the lens potential has a variety of nontrivial 
but smooth structures such as different radial profiles, ellip- 
ticities, octopole (m = 4) modes, and external tidal shears. In 
iKeeton et alJd2003l hereafter Paper I), we studied the cusp re¬ 
lation. We found it to be insensitive to the radial profile of the 
lens galaxy, but quite sensitive to ellipticity, multipole modes, 
and shear. We quantified the degree to which these features 
can cause jp^l “ |mb| + ImcI to deviate from zero even for 
smooth lenses. We then compared those allowed deviations 
with observed data and found that five observed lenses vio¬ 
late the realistic cusp relation in ways that indicate the pres¬ 
ence of small-scale structure (B0712H-472, RX J0911H-0551, 
SDSS 0924H-0219, RX J1131-1231, and B2045H-265). We 
were very careful to state the limitations of the analysis, in 
particular to remark that study of the cusp relation — like all 
other analyses of single-epoch, single-band flux ratios — can¬ 
not reveal the nature of the implied small-scale structure. The 
strongest conclusion that can be drawn from a generic anal¬ 
ysis is that the lens must have significant structure on scales 
smaller than the separation between the triplet of cusp images. 
We believe that this sort of deep discussion of the general fea¬ 
tures and applicability of a generic magnification relation is as 
valuable as the specific identification of flux ratio anomalies 
that it allows. 

In this paper, we turn our attention to the fold relation. We 
again seek to understand the general properties of the relation 
in realistic situations, and to use that understanding to iden¬ 
tify violations of the fold relation. We adopt the same basic 
approach as in Paper I: we first examine simple lens potentials 
analytically, then develop a Monte Carlo approach to study 
the fold relation in a realistic lens population, and finally use 
the realistic fold relation to look for flux ratio anomalies in ob¬ 
served lenses. However, many fine points of the discussion are 
rather different, because there are subtle but important ways 
in which the fold and cusp relations behave differently. In ad¬ 
dition, we have come to understand that the fold relation may 
be used — with great care — to learn something interesting 
about image pairs that are not obviously fold pairs. The dis¬ 
cussion of observed lenses therefore has a somewhat larger 
scope in this paper than it did in Paper I. One final difference 
is that the sample of published four-image lenses has grown 
by three since Paper I. 

As in Paper I, we assert that, even though we adopt specific 
families of lens potentials, our analysis is more general than 


explicit modeling. One reason is that we take pains to under¬ 
stand what is generic in the fold relation. A second reason 
is that we have a better distinction between global and local 
properties of the lens potential. For example, a global m=l 
mode (i.e., a lack of reflection symmetry) would affect con¬ 
clusions about anomalies in direct modeling, but not in our 
analysis. A third point is that our results are less modeling de¬ 
pendent, less subject to the intricacies of fitting data and using 
optimization routines. A fourth advantage of our analysis is 
that, rather than simply showing the standard models fail to fit 
a lens, it clearly diagnoses why. We believe that these benefits 
go a long way toward establishing that small-scale structure in 
lens galaxies is real and can be understood. 

We must address a question that is semantic but important; 
Where do we draw the line between a normal “smooth” lens 
potential and “small-scale structure”? As in Paper I, we take a 
pragmatic approach and consider “smooth” lenses to include 
anything that is known to be common in galaxies, especially 
early-type galaxies: certain radial density profiles, reason¬ 
able ellipticities, small octopole modes representing disky or 
boxy isophotes, and moderate tidal shears from lens environ¬ 
ments. Finding evidence for any or all of those in any given 
lens would not cause much stir. We then consider “small- 
scale structure” to be anything whose presence in lens galax¬ 
ies would be notable and worthy of further study. In other 
words, we do not attempt to distinguish between microlens- 
ing, CDM-type substructure, massive and offset disk compo¬ 
nents, large-amplitude or intermediate-scale multipole modes, 
etc. as explanations for flux ratio anomalies. That is properly 
the subject of a separate analysis, which can begin only once 
flux ratio anomalies have been rigorously identified. 

The organization of the paper is as follows. In ^we in¬ 
troduce a way to quantify four-image lens configurations that 
is convenient for the fold relation. In we review the ideal 
magnification relations for folds and cusps. In 0we use a 
simple lens potential to examine the general properties of the 
fold relation in different regimes. In ^we introduce a Monte 
Carlo technique for characterizing the fold relation for a re¬ 
alistic population of lens potentials. In ^we use our under¬ 
standing of the fold relation to evaluate all of the observed 
four-image lenses. We offer our conclusions and discussion 
in Two appendices provide supporting technical mate¬ 
rial. In Appendix A we present an important extension of the 
usual Taylor series analysis leading to the fold relation. In 
Appendix B we summarize the data that we analyze for all of 
the published four-image lenses. 


2. CHARACTERIZING FOUR-IMAGE LENSES 

At least 23 quadruply imaged lens systems are known. 
This count includes the 10-image system B1933H-503, which 
is complex because there are two different sources that 
are quadrup l y ima ged and a third that is doubly imaged 
ilSvkes et al'l 119^ . It excludes PMN J0134-0931 and 
B1359H-154 because each system has multiple l ens galaxies 
that lead to image multiplicit i es larger than fou r llRusin et alJ 
l200U IlCeeton & Winnllioo'^ IWinn et al.ll2003ll . The count 
also excludes systems like Q0957H-561 in which some faint 
secondary features, including the host galaxy of the source 
quasar, may be quadruply imaged but a re difficult to study 
llBernstein et alJl 9971: iKee ton et alJ2flfl0ll . Published data for 
the quadruply imaged systems are reviewed in Appendix B. 

The image configurations of quad lenses can usually be 
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Fig. 1.— Three fiducial configurations of four-image lenses: fold (top), 
cusp (middle), and cross (bottom). In each panel, the figure on the left shows 
the caustics and source position in the source plane, while the figure on the 
right shows the critical curves and image positions in the image plane. De¬ 
spite appearances, the fold and cusp sources sit a finite distance from the 
caustic. The configurations are distinguished by the distances d\ and d 2 , as 
indicated. 


classified “by eye” into three categories: folds, cusps,® and 
crosses. The names are related to the location of the source 
with respect to the lensing caustics. For our purposes it is 
more important to find a simple but quantitative way to char¬ 
acterize the configurations. When studying cusp lenses in Pa¬ 
per I, we used the separation and opening angle of a triplet 
of images. To study fold configurations, we are interested 
in pairs of images, in particular pairs comprising one image 
at a minimum of the time delay surface and one at a sad¬ 
dle point. (The parities of the images can usually be deter¬ 
mined unambiguously; see ISaha & Williamil2003t ') Let us 
label the two minima Mi and M 2 , and the two saddles Si and 
82 - (For definiteness, suppose Mi is the brighter minimum and 
M 2 the fainter, and likewise for the saddles.) When consider¬ 
ing the pairMi^i, for example, we define di = D{Mi,Si) and 
d 2 = mm[D{Mi,S 2 ),D{M 2 ,Si)\, where D(i,j) is the distance 
between images i and j. In other words, di is the separation 
between the images for the pair in question, and d 2 is the dis¬ 
tance to the next nearest image. Note that di and d 2 describe 
a pair of images. At times it is convenient to characterize the 
full configuration of all four images, and we define d* and d^ 
to be the values of di and d 2 for the pair with the smallest 
separation. In other words, a given four-image lens is fully 
characterized by the four values of (t/i, d 2 ) for the four differ¬ 
ent minimum/saddle pairs; but it is sometimes convenient to 
use {dl,dip as an abbreviation that encodes the overall mor¬ 
phology of the lens. 

Figure 1 illustrates the three fiducial configurations, and in¬ 
dicates di and d 2 for sample image pairs. In a fold lens, the 
source sits near a fold caustic so two of the images lie close 
together with di ^ d 2 - Furthermore, d 2 is comparable to the 
other scale in the problem, the Einstein radius /?ein- In a cusp 
lens, the source is near a cusp caustic so three of the images 

® Some authors subdivide cusps depending on w hether they are associated 
with the long or short axis of the lens potential te.g. . ISaha & Williamsl2()()3) . 
and we will follow suit when convenient. 


are close together and we have di ~ t/a “C /?ein- If the source 
does not lie near a caustic, then the images form a relatively 
symmetric cross configuration with di ^ d 2 ^ Rem- 
Incidentally, the three “archetypal” image configurations 
shown in Figure 1 were created using a singular isothermal el¬ 
lipsoid lens with axis ratio q = 0.5 or ellipticity e= l-q = 0.5. 
We chose source positions such that = 

0.46/?ein. which is similar to the separation between close im¬ 
age pairs and triplets in observed fold and cusp lenses. The 
values = df°^^ = 1.54/?ein were set by the choice of el¬ 
lipticity. We then chose the fold source position such that 
d^ = = d 2 °^^- Having different distances be equal to 

each other means that we can smoothly morph from the fold 
to the cusp by fixing di and varying d 2 , or from the fold to the 
cross by fixing d 2 and varying di. 

3. ASYMPTOTIC MAGNIFICATION RELATIONS FOR FOLDS AND 

CUSPS 


In this section we briefly review the expected relations 
between the magnifications of images corresponding to a 
source near a fold or cusp caustic. The relatio n s have been 


discussed before (iBlandford & NaravanI 

19861 IMaolll992I 

Schneider & Weisslll992l iSchneider et al.l 

19921 iPetters et al.1 

200lUGaudi & Petters|2002a|b|). but we have extended the re- 


lations to a higher order of approximation. 

As Paper I discussed in depth, when the source lies near a 
cusp caustic, the three associated images should have^ 


n _ lMA|-|fIfi| + |Mc| _ F/i-Fb + Fc ^ ^ 

Im^I + ImbI + ImcI Fa+Fb + Fc^ ^ ^ ’ 

where /i,- is the signed magnification of image i, while F, = 
Fsrc|/ii | is the flux of the image if the source has flux Fs^. 
(f^cusp is defined such that it is independent of Fsrc.) In our 
naming convention, B is the middle of the three images and 
there is no need to specify whether it is a minimum or saddle 
image. To state eq. o more precisely, we expand the lens 
mapping in a Taylor series about the cusp and find R^usp = 
0 -l-Acusp<i^ + ■ ■ ■, where d is the maximum separation between 
the three images, while Acusp depends on properties of the lens 
potential at the cusp point (physically, what matters is the el¬ 
lipticity, higher-order multipoles, and external shear; see Pa¬ 
per I). Since the constant and linear terms vanish, a source 
lying sufficiently close to the cusp produces three close im¬ 
ages with d ^ 0 and hence R^usp 0. As the source moves 
a small but finite distance from the cusp, the cusp relation 
picks up a correction term at second order in d. Nevertheless, 
for realistic distributions of ellipticity, multipole amplitudes, 
and shear, it is possible to derive reliable upper bounds on 
Fcusp. Roughly speaking, we may say that those bounds can 
be violated only if the lens potential has significant structure 
on scales smaller than the distance between images, although 
Paper I provides a much more careful discussion. 

Appendix A of this paper shows that when the source lies 
near a fold caustic, the two images near the fold critical point 
should have 


Rfold 


Ifimin I iMsadI 

iMmin I iMsadl 


^sad 

^sad 


0 . 


( 2 ) 


We are interested in pairs consisting of a minimum and a sad¬ 
dle, and we define T^foid such that the saddle image gets the 


^ In Paper I we used the absolute value of Rcusp^ btit it has become 
clear that the sign is an important component of theoretical predictions 

('e.p..|Metcalf & Madai^2Q^^ Schechter & Wambsganssi20Q2tlKeetoiJ20Q3t 

IKochanel^^^alS ll2nn^lRradac e.t alJI2()nir SO we retain it now. Working 
with the signed quantity would not change the conclusions of Paper I. 
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minus sign in the numerator. Again, to be more precise we 
use a Taylor series expansion of the lens mapping near the 
fold point to find f?foid = 0 +A'(Am)'/^ + ..where Am is the 
perpendicular distance of the source from the caustic, and A' 
is a constant t hat dep ends on local properties of the lens po¬ 
tential (see ea. lA19l in Appendix A). Working instead in the 
image plane, we can write /?foid = O-l-Afoidt/i w here di is 
the distance between the two images (see eq. IA20> . The ideal 
fold relation /?foid ^ 0 holds only when the source is asymp¬ 
totically close to the caustic. Now there is a correction term at 
hrst order in lii, whose coe fficien t Afoid depends on properties 
of the lens potential (see en. lA21 > . In other words, the correc¬ 
tion to the fold relation is of lower order than the correction to 
the cusp relation, which means that the fold relation is more 
sensitive to a small offset from the caustic. Thus, some care 
will be needed to determine whether an observed violation of 
the ideal fold relation really reveals small-scale structure, or 
just indicates that the source lies a finite distance from a fold 
caustic. 


4. UNDERSTANDING THE EOLD RELATION 
4.1. In the asymptotic regime 

We can begin to understand general features of the fold re¬ 
lation by examining the coefficient in the asymptotic limit for 

Rfo\d, 


A fold = 




3V'112V'222 + V’2222(1 -lAll) 
6V>222(1 -i/’ll) 


(3) 


where the tp’s. represent various derivative s of the le ns po¬ 
tential, evaluated at the fold point (see eas. lA2ll4A23l in Ap¬ 
pendix A). Imagine moving along the caustic and evaluating 
Afoid at various points. As we approach a c usp, ^ 0 while 
the other derivatives remain finite (e.g.. iPetters et alJl2001[ 
p. 346), so |Afoid| ^ oo. The sign depends on the type of cusp. 
A “positive” cusp has two minimum images and one saddle, 
and typically occurs on the long axis of the lens potential; it 
has Afoid —*■ “OO with a minus sign because the saddle image is 
brighter than each minimum. A “negative” cusp has two sad¬ 
dles and one minimum, and typically occurs on the short axis 
of the potential; it has Afoid ^ +oo with a plus sign because 
the minimum image is brighter than each saddle. One im¬ 
plication of |Afoid| ^ oo is that the fold relation breaks down 
near a cusp, but that is not surprising because the asymptotic 
analysis in Appendix A explicitly assumes that we have cho¬ 
sen a fold point and are examining a small neighborhood that 
does not include a cusp point. Besides, near a cusp it is the 
cusp relation that ought to be satisfied, not the fold relation. 

The more interesting implication is that Afoid can take on all 
real values, both positive and negative. Unless there is some 
remarkable discontinuity, there must be a region where Afoid 
changes sign. Figure 2 confirms that this is the case for a typ¬ 
ical example, namely an isothermal ellipsoid lens with axis 
ratio q = 0.5 or ellipticity e = \ -q = 0.5. There is a region 
where |Afoid | is small or even zero, so that the ideal fold rela¬ 
tion /?foid ^ 0 is quite a good approximation. In this region 
the distance d 2 is large,* but interestingly the smallest values 
of |Afoid| do not correspond to the largest values of d 2 - Over 
the larger range where d 2 is large enough that the image con¬ 
figuration would be classified as a fold (roughly d 2 ^ 1), we 
find |Afoid| ~ 0.1-0.3. The important implication is that a lot 


° The distance di can be arbitrarily small depending on how close the 
source is placed to the caustic, but d 2 remains finite even when the source lies 
right on the caustic. 



Fig. 2.— Caustic curve for an isothermal ellipsoid lens with an axis ratio 
q = 0.5, or ellipticity e = l-q = 0.5. The numbers above the points indicate 
values of the coefficient Afou in the asymptoti c fold relation Rfoia = Afou di + 
... at various points along the caustic. (See eq.|A21Jin Appendix A; recall that 
Afoid is to be evaluated on the caustic, but it then describes the fold relation 
in the vicinity of the caustic.) The numbers below the points indicate the 
corresponding values of d^ (in units with Rein = 1). The other quadrants can 
be filled in by symmefry. 


of lenses that are clearly folds may nevertheless fail to satisfy 
the ideal fold relation f?foid ^ 0. 

An even more important conclusion is that the validity of 
the ideal fold relation depends not just on whether the source 
is close to a fold caustic, but where the source is located along 
the caustic. This point is shown more directly in the next sub¬ 
section. 


4.2. Across the source plane 

To mo ve beyo n d the asymptotic regime, we use the soft¬ 
ware by iKeetonI (1200 Ih to solve the lens equation exactly 
throughout source plane for an isothermal ellipsoid lens with 
ellipticity e = 0.5. For each source inside the astroid caustic, 
we find the four images, identify the pair with the smallest 
separation, and then compute d\, d 2 , and f?foid for that pair. 
(These are by definition the same as d^ and ^f|.) The results 
are shown in Figure 3. First, it is valuable to understand how 
di and d 2 vary with source position, as shown in panels (a) 
and (b). The separation di between the images measures very 
directly the distance of the source from the caustic. The dis¬ 
tance d 2 to the next nearest image varies along the caustic, and 
basically measures the distance of the source from a cusp. In 
general, fixing both di and d 2 fixes the source to one of eight 
positions (two in each quadrant). 

Panels (c) and (d) show /?foid as a function of source posi¬ 
tion, with contours of di and d 2 overlaid. /?foid is large and 
negative near the long-axis cusp, and it is large and positive 
in a band extending from one short-axis cusp to the other and 
passing through the origin. The area in which /?foid > 0 is 
larger than the area in which /?foid < 0, which means that the 
distribution of f?foid values is not symmetric about /?foid = 0. 
Near the origin, /?foid is large and positive. Both of these 
points will be important for our analysis of real lenses in 
^ There is a “wedge” of small f?foid values starting at the 
caustic but extending well inside; this corresponds to the re¬ 
gion where the asymptotic coefficient Afoid is nearly zero (see 
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Fig. 3.— (a) The grayscale and contours both show the distance di as a function of source position, for an isothermal ellipsoid lens with ellipticity e = 0.5. The 
contours range from 0 to 1.5 in steps of 0.25, in units with = 1- (^) The distance d 2 as a function of source position; the contours are again spaced by 0.25. 
(c) The colors show RMd as a function of source position; the color coding is shown along the top edge of panel e. The di contours from panel a are overlaid, (d) 
The colors again show Rfo\d, with the do contours from panel b now overlaid, (e-f) The colors and contours both show i?foid as a function of d{ on the abscissa 
and d 2 on the ordinate; the contours range from —0.4 to 0.4 in steps of 0.1. Panel e shows the case i?foid > 0^ while panel f shows the case /?foid < 0- Recall that 
folds have di <^d 2 ^ein. cusps have d\ d 2 Rein, and crosses have di d 2 Rein- The region outside the triangles is inaccessible for this lens potential. 
Note that the figures were generated with Monte Carlo simulations, and the sampling is imperfect at ~ Rein- 


Fig. 2). Interestingly, in the region near the caustic and mid¬ 
way between the cusps, where c /2 is large and where we would 
expect to find archetypal folds, /Jfoid is not terribly small. The 
source must get very close to the caustic before //foy vanishes. 
A remarkable visual impression is that T/foid seems to be more 
correlated with the c /2 contours than with the c/i contours. 


Finally, by tabulating the results for all the different source 
positions we can plot //fold in the (c/i,c/ 2 ) plane, as shown in 
panels (e) and (f).® There are two plots because there are two 
source positions in each quadrant, and hence two values of 

® We generated the figures with Monte Carlo sampling of the source plane, 
which yields imperfect sampling of the {di,d 2 ) plane in the lower left corner. 
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T^foid, with the same values of d\ and d 2 - These figures show 
more clearly that Rfou vanishes as di 0, but the speed with 
which that occurs depends on the value of d 2 . Furthermore, 
in the upper left corner (the region of fold configurations), the 
7?foid contours bend over and become quite sensitive to d 2 . 

We are forced to conclude that the fold relation depends not 
only on proximity to a fold caustic, but also on location along 
the caustic. Although we are not shocked — we knew that 
the fold relation should break down near a cusp — we are 
nevertheless surprised to discover how sensitive /Jfoid is to lo¬ 
cation along the fold caustic even when the source is far from 
a cusp. This point is profound, because the location of the 
source along the caustic is not observable, and cannot really 
be determined from the properties of the two images in the 
fold pair; it can only be inferred by considering the properties 
of the other two images as well. In particular, the distance 
d 2 to the next nearest image gives some indication of the lo¬ 
cation of the source along the caustic, and therefore plays a 
strong role in the fold relation. 

We begin to suspect that using the fold relation in practice 
is not a simple matter of finding a close pair of images and 
asking how much they deviate from /?foid ~ 0; the fold relation 
is in truth more subtle and rich. 

4.3. For all four image pairs 

So far, among the four images in a given configuration we 
have only examined the pair with the smallest separation, be¬ 
cause the fold relation best describes close pairs. The formal¬ 
ism can be applied to any pair, however, and to round out our 
general understanding of the fold relation it is instructive to 
examine all the pairs. 

Figure 4 shows Rfon for all minimum/saddle image pairs, 
as a function of the distances d* and d^ that characterize the 
image configuration. In the top panel we fix d^, so varying d^ 
morphs the configurations from folds to crosses. The largest 
value of d* corresponds to a symmetric cross, in which case 
the two minima are identical and the two saddles are iden¬ 
tical, so all four minimum/saddle pairs have the same value 
of /?foid- In the limit d^ ^ 0 we obtain ideal folds, and the 
fold pair converges to Rfoid 0 (the ideal fold relation). In 
this limit two other pairs converge to /?foid ^ ±1, which is 
easily understood: the two fold images (A and B in Fig. 1) 
have much higher magnifications than the two other images, 
so the pairs AD and CB will both have /Jfoid ^ ±1. There is 
no intuitively obvious asymptotic limit for the pair of non-fold 
images (CD). The figure suggests that such limits do exist, but 
we suspect that they depend on properties of the lens potential 
in ways that the limits TJfoid ^ 0 and ±1 do not. 

In the bottom panel we fix df so varying morphs the 
configurations between folds and cusps. Here the fold limit 
does not quite reach Rfon 0 and ± 1 because we have fixed 
di to a finite value that does not actually correspond to an 
ideal fold. The more interesting limits are in the direction of 
cusps. As £/| ^ d^ we obtain a symmetric cusp configuration. 
For a symmetric ideal cusp, we can predict /?foid ^ ±1/3 and 
±1 based on the following logic. By symmetry. Fa = Fc so 
the ideal cusp relation implies Fb « 2 Fa, and the fold relation 
then yields /?foid ~ ±1/3. The sign is -I- for a long-axis cusp 
(in which case B is a saddle), or - for a short-axis cusp (B is a 
minimum). At the same time, in an ideal cusp the images A, 
B, and C are all much brighter than the fourth image D, so any 
pair involving D has fJfoid ~ ±1. As the source moves around 
the numerical values will change, but we generically expect 
two distinct values of Rfoid, one positive and one negative, for 


Fold Cross 






0 0.5 1 1.5 2 

d’ 


Fig. 4.— (Top) infold as a function of the distance d*, for fixed = 1.54. 
Moving from left to right smoothly changes the image configuration from 
fold to cross. There are multiple curves because there are four image pairs 
for each image configuration, and there may be two different configurations 
with the same For each configuration, the smallest-separation pair 

is marked in red. (Bottom) R^d as a function of the distance d^, for fixed 
d* = 0.46. Moving from left to right smoothly changes the image configura¬ 
tion from cusp to fold. The horizontal lines show various asymptotic limits: 
^foid 0 for an ideal fold pair; i?foid —^ dil/3 for the two pairs of an ideal 
cusp triplet; and /?foid —^ il for two other pairs in an ideal fold or cusp 
lens. The vertical lines indicate the separations for our archetypal lenses: 

^*fold ^ ^«usp ^ ^*cusp ^ Q ^*cusp ^ ^ ^*cross ^ j 54 

an isothermal ellipsoid lens with ellipticity e = 0.5, and quote all lengths in 
units of Rein¬ 


cusp lenses. Our archetypal cusp lens does not quite reach 
the asymptotic values because the source lies a finite distance 
from the cusp {d^ is 0.46 rather than 0), but it does confirm 
the basic reasoning. 

Examining Rfoid for widely-separated image pairs in this 
way does not really tell us about small-scale structure, be¬ 
cause we are no longer restricted to short length scales. Nev¬ 
ertheless, it is still helpful for obtaining a general understand¬ 
ing of the fold relation. 

5. THE FOLD RELATION IN REALISTIC LENS POTENTIALS 

While the ideal fold relation ^ 0 is completely gen¬ 
eral, it is only valid when the source is extremely close to 
a caustic. In realistic situations, the better approximation 
Rfoid =^foidtIi depends on the source position and properties 
of the lens potential. In Paper I, we explicitly showed that the 
properties of the lens potential affecting the cusp relation are 
ellipticity, low-order multipole modes, and tidal shear. Here, 
we simply define a “realistic smooth lens” to be one that has 
these angular structures. (See m for more discussion.) Un¬ 
fortunately, the ellipticity, multipole moments, and shear in 
individual lenses cannot be observed directly. Ellipticity and 
multipole modes in the lens galaxy light may be measurable, 
but for lensing we need the properties of the mass. The mass 
properties could be constrained with lens m odeling (with per - 
haps the best example being the analysis bv lYoo et al.ll200-5l) . 
but we seek to avoid model dependence as much as possible. 
Instead, our approach is to adopt observationally motivated 
priors on the distribution of ellipticity, multipole modes, and 
shear and use Monte Carlo simulations to derive probability 
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distributions for /?foid for a realistic lens population. 

5.1. Methods 

The simulation methods are the same as in Paper I, so we 
review the main points here and refer the reader to that paper 
for further details. We consider only isothermal radial pro¬ 
files (S oc R~^ ) for the simulated galaxies, because in Paper 
I we showed that local analyses of the lens mapping are not 
very sensitive to changes in the radial profile. For the angular 
structure, we consider ellipticity as well as additional octopole 
modes (m = 4 multipole perturbations). To model populations 
of early-type galaxies, we draw the ellipticities and octopole 
moments from measurements of isophote shapes in observed 
samples of early-type galaxies.Even if the shapes of the 
light and mass distributions are not identical on a case-by-case 
basis, it seems reasonable to think that th eir distributions may 
be similar (see iRusin & Tegmarkll200TI) . Indeed, the distri¬ 
bution of isodensity contour shapes in simulated merger rem¬ 
nants is very similar to the observed distobutio n of isophote 
shapes (iHevl et al.ll994t^aab & Burkerd20()^ . We use three 
different observational samples, because they have different 
strengths and weaknesses and allow a check for systematic 
effects: 

• |j0rgensen, Franx & Kjaergaardl Jl995ll report elliptici¬ 
ties for 379 E/SO galaxies in 11 clusters, including 
Coma. Their ellipticity distribution has mean e = 0.31 
and dispersion cje = 0.18. They do not report octopole 
moments. 

• iBender et alJ dl989t) report ellipticities and octopole 
moments for 87 nearby, bright elliptical galaxies. Their 
ellipticity distribution has mean e = 0.28 and dispersion 
CTc = 0.15, while their octopole moment distribution has 
mean dn = 0.003 and dispersion = 0.011. 

• ISaglia, Bender & Dresslerl (Il99.3h report ellipticities 
and octopole moments for 54 ellipticals in Coma. Their 
ellipticity distribution has e = 0.30 and ag = 0.16, while 
their octopole moment distribution has 04 = 0.014 and 

= 0.015. 

The ellipticity and octopole distributions for the three samples 
are shown in Fig. 6 of Paper I. All three samples are limited 
to low-redshift galaxies (by the need for good resolution to 
measure isophote shapes). We must assume that the distribu¬ 
tions are reasonable for intermediate-redshift galaxies as well, 
which seems plausible if major mergers involving ellipticals 
are infrequent. 

For the external shear amplitude, we adopt a lognormal dis¬ 
tribution with median 7 = 0.05 and dispersion = 0.2 dex. 
This is consistent with the distribution of shears expected 
from the environments of early-type galaxies, as estimated 
from A -body and semianaly t ic siin ulations of galaxy forma¬ 
tion by iHolder & Schechted ll2003l) . It is broadly consistent 
shears required to fit observed lenses, 
use a halo model calculation to sug¬ 
gest that the median shear should be more like 7 = 0.03. How¬ 
ever, the smaller median shear is not very compatible with 
the shears required to fit observed four-image lenses (e.g., 

*** Most lenses are produced by early-type galaxies. Among the four-image 
lenses, the only known spiral lens galaxy is in Q2237-I-0305, and even there 
the images are most affected by the spheroidal bulge. This lens is not very 
important for our analysis, because iUs_a_crossJenSj_and because it is already 
known to exhibit microlensing (e.g.,|Wozni^_etalJ200C|). 


with the distribution of 



iKeeton et al.lll99'^ . Furthermore, if we want to determine 
how much /?foid can deviate from zero for smooth lens po¬ 
tentials, then the conservative approach is to adopt the larger 
median shear. We assume random shear directions. 

For each combination of ellipticity, octopole moment, and 
shear'* * *** we choose random sources with de nsity ^ lO^/^lil , 
solve the lens equation using the software bv iKeetonI J 2001 ll . 
and compute {di^d 2 ,Rfo\i) for each minimum/saddle image 
pair. For each input distribution, we examine a total of ~ 10® 
mock four-image lenses. Note that choosing sources with uni¬ 
form density in the source plane has two important conse¬ 
quences. First, it ensures that each lens potential is automat¬ 
ically weighted by the correct lensing cross section. Second, 
it means that we neglect magnification bias, which would fa¬ 
vor lenses with higher amplifications, and therefore give more 
weight to sources near the caustics that produce small devia¬ 
tions from the ideal fold relation. We therefore believe that ne¬ 
glecting magnification bias is the conservative approach when 
seeking to understand how large the deviations can be for 
smooth lens potentials. 

5.2. First results 

We use our ensemble of mock image configurations to ex¬ 
tract the probability distribution for /Jfoid at fixed values of 
di and Conceptually, this is like going to the appro¬ 

priate point in the {di,d 2 ) plane of Figure 3e-f and reading 
off Rfoid, except that we now consider a large ensemble of 
lens potentials. To illustrate how we use these distributions. 
Figure 5 compares the value of /?foid for each image pair in 
our archetypal fold, cusp, and cross lenses to the appropri¬ 
ate conditional probability distribution p{Rfo\i\di,d 2 ) derived 
from the Monte Carlo simulations. If the observed value lies 
outside the predicted distribution, then we conclude that the 
image pair is inconsistent with lensing by a realistic popula¬ 
tion of smooth lens potentials. It is reassuring to see that our 
archetypal lenses (which were generated with a smooth lens) 
are indeed found to be consistent with lensing by a smooth 
potential. 

We can observe some of the general features identified in 
34.31 Many of the distributions are bimodal, and some of 
those have two completely disjoint peaks. This is because 
given values of di and d 2 can correspond to multiple source 
positions that yield different /?foid values (see Fig. 3). In 
the fold lens, the pair involving two fold images (AB) has 
Rfoid ~ 0. The two pairs involving one fold and one non¬ 
fold image (CB and AD) each have |/?foid| large; they do not 
have /?foid —> ± 1 because the source sits a finite distance from 
the caustic, but the general trend that the fold pair has small 
|/^foid| while the two fold/non-fold pairs have large |/?foid| is 
confirmed. In the cusp lens, two pairs have Rfo^ « ±1/3 
while the other two have |/?foid| large. Again, the reason 
that the peaks in the AD and CD pairs do not actually reach 
Rfoid ^ ± 1 is because the source sits a finite distance from the 
caustic. Finally, in the cross case all four pairs have similar 
Rfoid distributions — identical in the case of a symmetric cross 
— which are centered at some positive value but fairly broad. 
The consistency between our general analytic arguments and 
our detailed Monte Carlo simulations is reassuring, and indi- 

Note that we need not specify the galaxy mass, because for an isothermal 
lens the mass merely sets the length scale Rejn, and we can always work in 
units such that Rein = 1. 

Strictly speaking, to accommodate our finite sampling of phase space, 
we consider all image configurations within ±0.05 of the specified d[ and d 2 
values. 
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Fig. 5.— Probability distributions for i?foid- On the left we show the image configurations, with lensed images (A-D) marked in blue and the lens galaxy (G) 
marked in red. The four columns then show /?foid for the different image pairs, sorted by increasing from left to right. (We adopt the convention of naming 
each pair such that the first letter indicates the minimum image while the second letter indicates the saddle.) The green vertical lines mark the actual values for 
our archetypal lenses. The black curves show the predicted distributions for realistic lens populations, with solid, dashed, and dotted curves showing results for 
Monte Carlo simulations based on the Jprgensen, Bender, or Saglia galaxy samples, respectively. The predicted distributions are normalized to unit area. 


cates that we have obtained new, deep insights into the fold 
relation. 


6. APPLICATION TO OBSERVED LENSES 

We are finally ready to examine the fold relation fo r ob - 
served four-image lenses. We summarize the data here 1 36. U . 
and provide more details in Appendix B. Our mai n int erest 
for the fold relation is of course fold image pairs ( 36.2> . but 
it is also inte resting to consider the other ima ge pa irs in fold 
le nses ( 36.3> . as well as image pairs in cusp ( 36.4> and cross 
1 36.5> lenses. 


6.1. Summary of the data 

Table 1 lists the values of d\, dx, Ran, and /Jfoid for all 
minimum/saddle image pairs in 22 known four-image lens 
systems,'^ and Appendix B provides some comments about 
the data. Most available flux ratio data come from broad¬ 
band optical/near-infrared images or radio continuum obser¬ 
vations. We consider separate optical and radio /?foid values, 
since they correspond to very different source sizes and there¬ 
fore provide different information about small-scale struc¬ 
ture. We also consider any other flux ratio da ta that are 
available; in the mid-infrared for Q2237H-0305 jjAgol^]^^ 
2000j) as well as PG 1115H-080 and B1422H-231 dChiba et al. 
2005 1, and the optical broad emission lines for HE 0435-1223 
dwisotzki et al.l2003l) . WFI 2033-4723 (iMorgan et al.l2004 . 
and SDSS 0924H-0219 (iKeeton et al.l20olL 
We need the Einstein radius Ran to normalize di and d 2 . 
This must be determined from lens models, but it is quite 
robust and not very dependent on the choice of model (e.g., 
lKochane^ll99U ICohn et Tll2001h . We treat the main lens 

We do not analyze 0047—2808, as discussed in Appendix B. 


galaxy as an isothermal ellipsoid with surface mass density 


S(r, 6 ) _ _^ein_ 

^crit 2r[l-ecos2(6»-0,)]'/2 ’ 


(4) 


where Sent is the critical surface density for lensing, e is re¬ 
lated to the axis ratio q of the galaxy by e = +q^), 

and is the position angle of the galaxy. Few four-image 
lenses can be fit by a pure isothermal ellipsoid model, be¬ 
cause elliptical lens galaxies tend not to be isolated. In most 
cases, modeling the environmental contribution to the lens po¬ 
tential as an external shear provides an e xcellent fit to the 
image positions (e.g.. iKeeton et al]ll997h . The exceptions 
are HE 0230-2130, MG J0414H-0534, RX J0911H-0551, and 
B1608H-656, each of which is known to have a satellite galaxy 
near the main lens g alaxy that must be included in order to fit 
the image position s (IWisotzki et al.l l 999H.Schechter & Moorel 

fT9^lKoonmanset al.l2003H) . We treat the satellite galaxies 

as isothermal spheres. We stress that when fitting the models 
to determine Rein, we use only the relative positions of the im¬ 
ages and the lens galaxy as constraints; it is not necessary to 
use the flux ratios as model constraints. 

Figure 6 shows how the observed lenses populate the 
(d \, d 2 ) plane. Although we have labeled them as folds, cusps, 
and crosses, in fact there is no sharp distinction between the 
fold and cross samples. SDSS 0924H-0219 and B1933H-503, 
which have di/Ran ~ 0.9, could arguably be r elabe led as 
crosses, although we choose not to do so (see 36.2t . The 
smooth transition simply reflects the fact that there are no 
sharp boundaries between different four-image configurations 
in the source plane. 

In the figure there is a particular region occupied by simu¬ 
lated lenses (the grayscale), but it is specific to an isothermal 
ellipsoid lens with ellipticity e = 0.5. Varying the ellipticity 
and/or adding shear would move the upper edge so that the 
region could accommodate the other cross lenses. The two 
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Fig. 6.— Colored points mark the locations of known four-image lenses in 
the plane of di /Ran and d 2 lRan- The color indicates the visual classification 
as a fold, cusp, or cross, while the point size indicates the value of Rfoid. To 
help guide the eye, the grayscale shows i?foid for an isothermal ellipsoid lens 
with ellipticity e — 0.5 (from Fig. 3e). 
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fold lenses at < 12 /Rein ~ 2 are a different story, though. These 
are HE 0230-2130 and B1608-1-656, each of which has two 
lens galaxies. Turning this around, we may say that observed 
lenses that are outliers in the {di,d 2 ) plane are likely to have 
complex lens potentials containing multiple galaxies. 

6.2. Fold image pairs 

We now examine the fold relation by comparing the ob¬ 
served fJfoid values to the distributions expected for a realis¬ 
tic galaxy population. Figures 7-8 show the comparisons for 
the 12 fold lenses,'^ arranged in order of increasing d^ //?em- 
While there is a tremendous amount of information here, the 
discussion in ^helps us pick out the main trends. First let 
us consider the various predicted distributions. When d*/Rein 
is small, for the fold pair (Column 1) is predicted to lie 
in a very narrow range near zero. This is the fold relation in 
its familiar form. At the same time, two other image pairs 
have distributions that feature two narrow and well-separated 
peaks (compare the top row of Fig. 5), while the fourth pair 
has a broad distribution with no particular center. As d* /Rein 
increases, the distribution for the fold pair broadens while 
the two peaks for the next closest pair (Column 2) tend to 
move closer together. HE 0230-2130 and B1608H-656 buck 
these trends, for a simple reason: they have two lens galaxies, 
so they have configurations that are rare in our Monte Carlo 
simulations,'^ and that leads to narrow and unusual predicted 
/Jfoid distributions. 

For most image pairs, there are no tremendous differences 
between the /Jfoid distributions from the three different sim¬ 
ulations (based on the Jprgensen, Bender, or Saglia galaxy 

Note that we now include B0712-H472 among the folds, even though we 
considered it a cusp in Paper I. Both classifications seem valid, depending on 
one’s purpose. The close pair AB can be considered a fold, while the close 
triplet ABC can be considered a cusp. The source must lie close to the caustic 
in a region not far from a cusp. For the purposes of this paper, it is a fold. 

In fact, the Jprgensen and Saglia simulations do not contain any con¬ 
figurations with the same rfi and di values as the BD pair in B1608-H656, to 
within the sampling resolution of our Monte Carlo simulations. 


samples; also see Fig. 9). This gives us confidence that our 
conclusions are robust in the sense of not being very sensitive 
to the simulation input data. 

Now we turn to the observed values of /Jfoid- Many of them 
lie within the predicted range, so there is no obvious violation 
of the fold relation. The outliers are as follows: 

• B0712H-472: The optical data grossly violate the fold 
relation, but the radio data do not (as in the cusp rela¬ 
tion; see Paper I). The wavelength dependence suggests 
that the optical anomaly is caused by microlensing. 

• B1555H-375: The radio data violate the fold relation at 
high confidence, as shown more clearly in Figure 9. 

• PG 1115H-080: The optical value of /?foid differs from 
the predictions at 99.2% confidence for the Jprgensen 
and Saglia simulations, and 96.1% confidence for the 
Bender simulations. The flux ratios can in prin- 
ciple be fit using large- amplitude multipole modes 
dKochanek & Dalalll2004l) . but such modes are incon¬ 
sistent with the Einstein ring image of the quasar host 
galaxy dYoo et al.l2005l) . In other words, it appears that 
this lens is anomalous, but the model-independent evi¬ 
dence is not quite as secure as for the other anomalies. 
The mid-infrared value of /?foid differs from the optical 
value and agrees well with the predicted distribution, 
suggesting that the optical anomaly is created by mi¬ 
crolensing. 

• SDSS 0924H-0219: Although AD is not a particularly 
close pair (making the predicted /Jfoid distribution fairly 
broad), image D is so faint that there is a gross viola¬ 
tion of the fold relation in both broad-band and broad 
emission line flux ratios. Differences between the flux 
ratios plus time variability suggest the presence of mi¬ 
crolensing dKochanekl2004btlKeeton et alJ200-'% . 

• B1933H-503: Although the 4/3 image pair is not partic¬ 
ularly close, image 4 is so bright that there is a clear 
violation of the fold relation in the radio data. 

HE 0230-2130 and B1608H-656 deserve mention because 
each has two lens galaxies. In HE 0230-2130, the observed 
Rfoid value shows images A and B to be more similar than 
expected for simple smooth lens potentials. In B1608H-656, 
the observed /?foid value agrees with the predictions for sim¬ 
ple lenses, which seems coincidental. These systems show 
that violations of the fold relation can reveal the lens potential 
to contain structure that is complex but not necessarily small- 
scale. That raises the question of whether any of the other 
anomalies could be caused by something large like a second 
galaxy. Probably not: in both of these systems the second 
galaxy was already known from di rect observations and anal¬ 
yses of the image positions (see 36.H . In other words, it is 
difficult for a massive second galaxy to escape notice. We 
therefore believe that the “second galaxy” hypothesis is not a 
valid explanation for most flux ratio anomalies. 

To summarize, our analysis of the fold relation reveals 
two flux ratio anomalies that were already known from vio¬ 
lations of the cusp relation (B0712H-472 optical, and SDSS 
0924H-0219 optical). It also reveals strong new evidence 
for fold flux ratio anomalies in B1555H-375 (radio) and 
B1933H-503 (radio), plus good but slightly less strong evi¬ 
dence for a fold anomaly in PG 1115H-080 (optical). In ad¬ 
dition, a violation of the fold relation in HE 0230-2130 (opti- 
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Fig. 7.— Observed and predicted Sfoid values, for six of the known fold lenses. The black curves show the predicted distributions for realistic lens populations, 
as in Fig. 5. The vertical colored lines show the observed values a nd their uncertaint ies, with green indicating optical/near-infrared data and red indicating radio 
data. For PG 1115+080, the blue lines indicate mid-infrared data JQiiba_etalJ2003). All data are listed in Table 1. The lenses are sorted by increasing 4j* /Rein 
from top to bottom. For each lens, the image pairs are sorted by increasing di/Rein from left to right. 


cal) is presumably due to the presence of a second lens galaxy 
in that system. 

While these specific conclusions are valuable, there are 
some important general lessons as well. First, even the closest 
observed fold image pairs have predicted /Jfdd distributions 
with a finite width. Therefore, a non-zero Rfoid value in the 
range Rfou ~ 0-0.2 cannot generally be taken to indicate a 
flux ratio anomaly. As a rule of thumb, when d\/Rem ^ 0.4 
it does appear that a value > 0.2 is likely to indicate an 
anomaly, although we caution that this is just a rule of thumb 
and a full analysis of the predicted /Jfdd distribution must be 
done to reliably identify an anomaly. 

The importance of the full analysis becomes clear when 
we consider PG 1115+080, WFI 2026-4536, and SDSS 
1004+4112. These three lenses have similar configurations 


with dx/Rein ~ 0.5, and (curiously enough) they all have 
^foid ~ 0.2. Yet one is anomalous (PG 1115+080), while 
the other two are fully compatible with the predicted distri¬ 
butions. What’s more, the predicted distributions for WFI 
2026-4536 are bimodal and qualitatively different from those 
for the other two lenses, even though all three image configu¬ 
rations are visually similar. These lenses teach the lesson that 
identifying fold flux ratio anomalies is not a simple matter of 
finding a close pair of images and asking whether Rfo\d 0. 
The distance from /?foid —> 0 that is needed to provide strong 
evidence for an anomaly depends in a complicated way on 
various properties of the lens potential that cannot be directly 
observed. Only a full and careful analysis of the fold relation 
can handle these issues. 

We conclude that violations of the fold relation can be used 
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Fig. 8.— Similar to Fig. 7, but for the remaining six known fold lenses (again sorted by rfj* /Scin)- For SDSS 0924+0219 and WFI 203 3-4723, the vertical 
green lines show data from broad-band optical flux ratios, while the vertical blue lines show data from optical emission line flux ratios IMorgan et alJI20(M 
iKeetnnTalilOl)^ . 


to find flux ratio anomalies in a fairly model-independent way. 
However, that analysis is more subtle than was previously re¬ 
alized. It is necessary to know not only the separation d\ be¬ 
tween the two images, but also the distance d 2 to the next 
nearest image, and to account for the finite width of the /?foid 
distribution expected for smooth lenses. 

Finally, it is worthwhile to comment that all twelve of the 
fold image pairs have /?foid values that are positive or con¬ 
sistent with zero. An important prediction to emerge from 
theoretical studies is that small-scale structure (either dark 
matter clumps or stars) should tend to amplify minimum im¬ 
ages and/or suppress saddle images (iMet calf & Madau|2QQlt 
Schechter & Wambsganssll2002t iKeetoniboOSi: lBrada£ et al.l 

20041) . Since either possibility would make /?foid > 0, see¬ 
ing only non-negative values is certainly consistent w ith the 
substructure hypothesis (see iKochanek & Dalaill2004l) . It is 


inconsistent with non-gravitational explanations of flux ratio 
anomalies (such as extinction or scattering), because those 
should affect minimum and saddle images in the same way. 
What is less clear is whether lumpy substructure is the only 
thing that can explain the asymmetry between minima and 
saddles, or whether small-scale but smooth structure is a vi¬ 
able alternative. Our analysis does offer an intriguing hint: 
nearly all the weight in our predicted /Jfoid distributions lies 
at /?foid > 0, which indicates that even smooth, global features 
like ellipticity and shear affect minima and saddles differently. 
Still, it is not clear whether smooth features can explain the 
further asymmetry that all the anomalous /?foid values exceed 
the predictions. The minimum/saddle asymmetry appears to 
be a very promising probe of small-scale structure, but much 
more study is clearly called for. 
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Fig. 9. — (a) Close-up of the AB panel for B1555-I-375 from Fig. 7. 
The red line shows the observed radio value for i?foid, with its uncertainty 
(see Table 1). The solid, dashed, and dotted curves show the predicted Rfoid 
distributions for Monte Carlo simulations based on the Jprgensen, Bender, 
and Saglia galaxy samples, respectively. For comparison, the hatched region 
shows the distributions for an isothermal ellipsoid with ellipticity e = 0.5. (b) 
Similar to panel a, but showing cumulative probability distributions. 


6.3. Other image pairs infold lenses 

We can also consider the /?foid values for the other image 
pairs in fold lenses, although we must be care ful a bout how 
we interpret them. As discussed in 31 and 15.21 there are 

some useful general properties of the fold relation for these 
other image pairs. For example, the predicted /?foid distribu¬ 
tion for a pair comprising a fold image and a non-fold image 
has two narrow peaks, one positive and one negative. While 
we are not aware of a simple way to predict the specific val¬ 
ues, it seems from Figures 7-8 that they depend mainly on 
dl/R^in- 

There are several image pairs for which the observed /?foid 
value lies far from the peaks in the predicted distributions; 
CB in B0712H-472, CB in B1555H-375, AC and DC in SDSS 
1004H-4112, and AjC and BC in WFI 2033-4723.'® The rel¬ 
atively large distance between the images in each pair pre¬ 
vents us from concluding that the discrepancies reveal “small- 
scale” structure in the lens. We can still conclude, though, that 
each pair is inconsistent with smooth lens models containing 
moderate ellipticities, octopole moments, and shears. Indeed, 
two of these lenses are already known to have complex poten- 
tials. SDSS 1004- H4112 is produced by a cluster of galaxies 
dOguri et al.l2004]) . while WFI 2033-4723 appears to lie in a 
group of galaxies with at least six perturbers lying within 20" 
of the main lens galaxy (iMorgan et al.l200^ . 

The situation seems different for the CB pair in PG 
1115H-080 and the CD pair in B0128H-437. In these cases the 
observed values lie in the tail of the predicted distributions, at 
around the 1% probability level. It may be that having two 
“rare” values among 88 image pairs is statistically unsurpris¬ 
ing, although it is hard to know how to quantify that possibil- 

Note that in WFI 2033—4723 the differences between the optical contin¬ 
uum flux ratios and t he emission line flu x ratios are interpreted as evidence 
for microlensing (see|Morgan_et^J200^. 


ity because the 88 pairs are not all independent. Alternatively, 
it may be that modest changes in the assumed distributions of 
ellipticity, octopole moment, and shear could raise the tail of 
the predicted /?foid distribution enough to make the observed 
values seem less unusual. 

Overall, we conclude that gross discrepancies between ob¬ 
served and predicted values for the “other” pairs in fold 
lenses indicate complex structure in the lens potential. It is 
not necessarily small-scale structure, but it is still interesting 
and worth studying with detailed lens models. 

6.4. Cusp lenses 

A cusp conhguration has a close triplet of images that can 
be t hought of as two close pairs. As discussed in § EH and 
15.21 we exspect the predicted fJfoid distribution for each pair to 
have two narrow and well-separated peaks. This constitutes 
a sort of “fold relation” for cusp lenses, which might help us 
better understand flux ratio anomalies in these systems. 

Figure 10 compares the observed and predicted /?foid distri¬ 
butions for the four observed cusp lenses. Image pairs like BA 
in RX J0911H-0551 and BA in RX J1131-1231, which have 
observed Rfoid values lying between and far from the two pre¬ 
dicted peaks, appear to indicate anomalies. In one sense these 
conclusions are not new, because these anomalies had already 
been identified through violations of the cusp relation (see Pa¬ 
per I). However, the fold relation can help us determine which 
of the three images is most anomalous. In RX J0911H-0551, 
the violation of the fold relation is stronger in the BA pair 
than in the BC pair, so we infer that image A is probably the 
one most affected by small-scale structure. Similar reasoning 
leads to the conclusion that image B in RX J1131-1231 is the 
most anomalous. There could in fact be more than one per¬ 
turbed image (see lDobler & Keetonl2005l for examples among 
other lenses), but the important point is that the fold relation 
can suggest which image is most anomalous — a distinction 
that could not be made by the cusp relation. 

The CB pair in B2045H-265 illustrates a curious aspect of 
this analysis. For this lens, simulations without octopole 
modes (using the Jprgensen data) predict an Rfoid distribution 
consisting of two narrow peaks far from the observed values. 
However, in simulations that include octopole modes (using 
the Bender or Saglia data), there is a small but finite proba¬ 
bility for Rfoid to lie between the two peaks. In this case, the 
possibility that octopole modes may be present limits our abil¬ 
ity to declare that the fold relation is violated. This result is 
surprising because B2045H-265 shows a very strong violation 
of the cusp relation, even when octopole modes are consid¬ 
ered (see Paper I). The difference must be that the cusp re¬ 
lation considered three images simultaneously, while the fold 
relation considers them in two separate pairs. 

Finally, we remark that the fold relation does not indicate 
anomalies among three cusp images in B1422H-231. This 
is consistent with our conclusion from Paper I that generic 
magnification relations do not identify anomalies in this lens, 
even though det ailed lens modeling suggests that it is in¬ 
deed anomalous (iMao & Schneideil1998HBradac et al.l2n0^ 
iMetcalf & Zhaol2002HDobler & Keetonll2005h . iBradaS et al.l 

J2002112004]) claim that the challenge for smooth lens models 
is not just the relative brightne sses of images A, B, and C, but 
also the faintness of image D. iDobler & Keeton! (l2005l) were 
able, though, to And an acceptable model under the hypothesis 
that only image A is perturbed by small-scale structure. We 
conclude that the nature of the anomaly in B1422H-231 is not 
yet clear, and generic magnification relations are not adequate 
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Fig. 10.— Similar to Fig. 7, but for the known cusp lenses. Again, the vertical green lines sh ow optical/near-in frared values of Sfoid, while the vertical red 
lines show radio values. For B1422+231, the vertical blue lines show mid-infrared values of i?foid |Qiiba_e^alJ2003). 


for understanding this system. 

6.5. Cross lenses 

The image pairs in cross lenses are not close pairs, but for 
completeness we still consider them in the context of the fold 
relation. Figure 11 compares the observed and predicted fJfoid 
values for the six known cross lenses. The predicted distri¬ 
butions are all broad and c enter ed at some positive value of 
^toid (also see Fig. 5 and 35.2> . There are several cases of 
disagreement, which can be understood as follows. First, in 
Q2237+0305 microlensing pertur bs the flux ratios, and in fact 
causes them to change with time JWozniak et alJ2000l) . 

Next, in HE 0435-1223, HST 14113H-5211, and HST 
12531-2914, several of the observed Rma values lie to the left 
of the predicted distributions. However, our ellipsoid+shear 
lens models fit the flux ratios fairly well, provided that the 
shear is allowed to be moderately large (7 = 0.13 for HE 
0435-1223, and 7 ^ 0.3 for the other two). Such large shears 
are rare in the distribution used for our Monte Carlo simula¬ 
tions, which is why the predicted fJfoid distributions in Fig¬ 
ure 11 do not extend down to the observed values. They 
are not unreasonable, though, for lenses that lie in com¬ 
plex environments like groups or clusters, which is probably 
the case for all three of these lenses (s ee IWitt & MaoiriWI 
iFischer et alJll99^lMorgan et alJl^05^ . In other words, the 
discrepancies in Figure 11 for these three lenses indicate that 
there is “interesting” structure in the lens potential, but in this 
case it is probably structure in the environment of the lens as 
opposed to small-scale structure. 

Finally, in HST 14176H-5226 and HST 1413H-117 there are 


discrepancies between the data and predictions that are not 
fully explained by a large shear. The fold relation cannot pro¬ 
vide strong conclusions here, but it does suggest that these 
two systems deserve further study. 

A striking general feature of Figure 11 is that nearly all 
the weight in the predicted probability distributions lies at 
^foid > 0, while some of the observed/?foid values are negative. 
We have argued that several of the negative observed values 
can be explained by large shears, which are absent from our 
Monte Carlo simulations. Thus, as a rule of thumb it appears 
that a negative /?foid value in a cross lens may indicate that 
there is a significant environmental contribution to the lens 
potential. 

7. CONCLUSIONS 

When the source in a four-image gravitational lens sys¬ 
tem lies sufficiently close to a fold caustic, the two images 
that straddle the fold critical curve should be mirror im¬ 
ages of each other, and the dimensionless flux combination 
Infold = (Emin-Fsad)/(Emin+F;ad) should Vanish. A violation of 
this “fold relation” in an observed lens is thought to indicate 
that the lens galaxy has significant structure on scales smaller 
than the separation between the two close images. The fold 
relation may therefore join the cusp relation as an important 
model-independent method for identifying flux ratio anoma¬ 
lies that indicate small-scale structure. 

We have learned, though, that the fold relation is more sub¬ 
tle and rich than was previously realized. The ideal fold re¬ 
lation /?foid ^ 0 holds only when the source is asymptotically 
close to a fold caustic. In more realistic situations, we find 
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Fig. 11.— Similar to Fig. 7, but for the known cross lenses. For 02237+0 305, the vertical blue l ines show mid-infrared values of Rfo\d lAgol et all200(11 : also, 
the green lines bend to represent the time variability in the op tical flux ratios IWoznia k et alJ2003) . with time running vertically. For HE 0435—1223, the vertical 
blue lines indicate data from optical emission line flux ratios iWisotzkr^ alJ2()()3l> . 


^foid oc cx di where u is the distance of the source from the 
caustic, while di is the distance between the two close images. 
In other words, /Jfoid goes to zero fairly slowly as the source 
approaches the caustic, which means that Rfou ^ 0 might just 
indicate that the source sits a finite distance from the caustic. 
(For comparison, the cusp relation has a more rapid depen¬ 
dence /?cusp oc d^\ see Paper I.) If we seek to use the fold rela¬ 
tion to identify flux ratio anomalies that indicate small-scale 
structure, then we must understand how much /?foid can devi¬ 
ate from zero just because of the finite offset of the source. 

This is where we find our most startling result: Rfou is de¬ 
termined not just by the distance of the source from the caus¬ 
tic, but also by the location of the source along the caustic. If 
we write /?foid = Afoid^fi -I- ..., then the coefficient Afoid varies 
enormously around the caustic. The problem is that the lo¬ 
cation of the source along the caustic, and hence the value of 


Afoid, is not directly observable. Consequently, it is no simple 
matter to say how large Rm^ must get before we can infer the 
presence of small-scale structure. Fortunately, the placement 
of the source along the caustic is encoded in the image con¬ 
figuration: not in the separation di between the close pair of 
images, but rather in the distance d 2 to the next nearest im¬ 
age. (For example, a source near a fold but not near a cusp 
leads to di ^ d 2 ^ R^in, while a source near a cusp leads to 
d\ ^ d 2 ^ Rein-) We may still be able to predict the range 
of /?foid possible for a smooth lens potential, but only if we 
consider d 2 as well as lii. 

This general understanding allows us to develop a gen¬ 
eral method for using the fold relation to search for flux ra¬ 
tio anomalies in real lens systems. We postulate a reason¬ 
able and realistic population of smooth lens potentials con¬ 
taining ellipticity, octopole modes, and tidal shear, and use 
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Monte Carlo simulations of four-image lenses produced by 
these lens potentials to derive the conditional probability dis¬ 
tribution p{Rfo\d\dl^d 2 ) for /Jfoid at hxed d\ and d 2 (strictly 
speaking, hxed di/Rein and lia/^ein)- We can then compare 
the observed value of Rfon for a real lens to the correspond¬ 
ing predicted distribution to determine whether the data are 
consistent with lensing by a smooth potential. In making our 
predictions, we actually consider three different galaxy popu¬ 
lations in order to understand how our results depend on as¬ 
sumptions about what constitutes a “reasonable and realistic” 
lens population. 

The fold relation applies most directly to the close pair 
of images in a lens with a fold conhguration. Among the 
12 known fold lenses, we hnd evidence for hve violations; 
the optical (but not radio) huxes in B0712H-472; the opti¬ 
cal huxes in SDSS 0924H-0219; the optical huxes in PG 
1115H-080; the radio huxes in B1555H-375; and the radio 
Huxes in B1933H-503. The optical anomalies in B0712H-472 
and SDSS 0924H-0219 were already known from violations of 
the cusp relation (see Paper I), but it is valuable to see them 
identihed by the fold relation as well. The optical anomaly in 
PG 1115H-080 is not quite as secure as the others; the conh- 
dence level is 99.2% for two of our three sets of predictions, 
but only 96.1% for one that includes fairly strong octopole 
modes. Detailed lens modeling of PG 1115H-080 suggests that 
multipole modes cannot provide an acceptable exp l anation of 
the H ux ratio anomaly dKochanek & DalaH l2004t lYoo et all 
l2005h . and while that conclusion is more model-dependent 
than ours, it does suggest that PG 1115H-080 is indeed anoma¬ 
lous. The radio anomalies in B1555H-375 and B1933H-503 are 
newly revealed by the fold relation. 

We believe that fold hux ratio anomalies provide robust and 
model-independent evidence for small-scale structure, for two 
reasons. First, the identihcation of the anomalies involves a 
local analysis of the lens mapping, and thus relies only on lo¬ 
cal properties of the lens potential. This is precisely what we 
want in an analysis aimed at revealing small-scale, local struc¬ 
ture. Second, we have explicitly shown that (apart from PG 
1115H-080) our conclusions do not change if we modify the 
parameter distributions that dehne our realistic galaxy popu¬ 
lation. 

Based on our detailed analyses of individual fold lenses, 
we can extract a few rules of thumb. Since the sources in 
real lenses always lie a hnite distance from a caustic, fJfoid 
values in the range 0 < /Jfoid ^ 0.2 are predicted to be quite 
common and probably do not indicate Hux ratio anomalies. 
When the separation between the two close images is small 
{di/Re.in ^ 0.4), the dependence on d 2 is not very strong; all 
that really matters is having d 2 be large enough for the image 
conhguration to be identihed as a fold. In this case, it appears 
that large values /?foid > 0.2 can reveal candidate anomalies. 
However, the fact that only one of four observed lenses with 
^foid ~ 0.2 is anomalous provides a strong reminder that a 
full and careful analysis of the fold relation must be done be¬ 
fore drawing conclusions about anomalies. Finally, as di in¬ 
creases, so too does the value of fJ^id required to indicate an 
anomaly, and rules of thumb about the fold relation cease to 
be valid. 

One hnal rule of thumb is that our smooth lens po¬ 
tentials almost always predict /Jfoid > 0 for fold image 
pairs, indicating an asymmetry such that minimum im¬ 
ages are generally expected to be brighter than saddle im¬ 
ages in fold pairs. This point probably has implications 
for the prediction that substructure affects minima and sad¬ 


dles dif fpently, tending to amplify minima but suppress 
saddl es dMetcalf & MadaiJ l2001t ISche chter & WambsganssI 
1201^ lK^nll2003t iBradac et al.li2004l) r and for the obser¬ 
vation that anomalous minima seem to be too bright while 
anomalous saddles seem to b e too faint (iKochanek & Dalai 
120041 [Pobler & Keetonl20ri^ . However, these issues are not 
yet fully understood, and further study is needed. 

Our full analysis of the fold relation also allows us to ap¬ 
ply it to the two close image pairs in a cusp lens. This ap¬ 
plication is more subtle because the analysis underlying the 
fold relation breaks down near a cusp caustic. Nevertheless, 
the predicted /?foid distribution for smooth lenses is bimodal 
with two narrow and well separated peaks, which constitutes 
a sort of fold relation that can be used to evaluate observed 
/?foid values. We hnd that applying this fold relation to cusp 
lenses does not reveal any new anomalies beyond those that 
were identihed by the cusp relation (Paper I). However, it may 
help us understand which of the three images is anomalous (a 
distinction that the cusp relation cannot make). For exam¬ 
ple, it appears that the strongest anomaly in RX 1091 Ih- 0551 
is probably in image A, while the strongest anomaly in RX 
11131-1231 is probably in image B. We take these conclu¬ 
sions less as dehnite statements and more as interesting sug¬ 
gestions to be examined with detailed lens models. One curi¬ 
ous qualitative result is that, in cusp lenses, the fold relation 
appears to be more sensitive than the cusp relation to octopole 
modes. This fact limits our ability to hnd a clear violation of 
the fold relation in B2045H-265, even though this lens has a 
very strong violation of the cusp relation. 

Finally, we can also use our full understanding to apply 
the fold relation to all minimum/saddle pairs in all four im¬ 
ages lenses, regardless of how close the pairs are. We must 
be very careful to remember that when di > Rein we are no 
longer performing a local analysis of the lens mapping, so 
we cannot claim to draw any model-independent conclusions 
about small-scale structure. Nevertheless, it is still interesting 
to determine which lenses seem to be inconsistent with lens¬ 
ing by an isothermal ellipsoid perturbed by octopole modes 
and moderate shear. Among fold lenses that do not have fold 
anomalies, we hnd that B0128H-437, HE 0230-2130, SDSS 
1004H-4112, and WFI 2033-4723 all have discrepancies be¬ 
tween the data and predictions for other image pairs.In 
three of these cases (SDSS 1004H-4112, HE 0230-2130, and 
WEI 2033-4723), the discrepancies are (presumably) caused 
by complex structure in the environment of the main lens 
galaxy. Einally, each of the six known cross lenses has at least 
one discrepant image pair. In Q2237H-0305 the discrepancy is 
caused by microlensing. In three others (HST 12531-2914, 
HST 14113H-5211, and H1413 h- 117) it may be attributed to 
a large shear from a complex lens environment. Again, we 
emphasize that discrepancies in the fold relation for large- 
separation image pairs cannot be taken as strong evidence for 
small-scale structure. However, they can suggest that the lens 
potential has some interesting and complex structure that de¬ 
serves further study. 

At this point it is worthwhile to review the lenses in 
which violations the cusp and fold relations provide model- 
independent evidence for small-scale structure in the lens po¬ 
tential; 

• Among four known cusp lenses, there are three anoma- 

Actually, the discrepancy in HE 0230-2130 is seen in the fold pair, but 
it can be attributed to the presence of a second lens galaxy so we prefer to 
discuss it here rather than among the fold flux ratio anomalies. 
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lies; RX J0911+0551 (optical), RX J1131-1231 (opti¬ 
cal), and B2045+265 (radio). 

• Among 12 known fold lenses, there are five anomalies; 
B0712+472 (optical but not radio), SDSS 0924+0219 
(optical), PG 1115+080 (optical), B1555+375 (radio), 
andB1933+503 (radio). 


There may be other anomalies that are not identified by a 
generic analysis, but that are reveale d by detailed lens model- 


ing; B1422+231 is a prime example i 

Mao & Schneideil 19981; 

Bradac et alj|2002l iMetcalf & Zhaol 

20021 iDobler & KeetonI 

20051). Moreoyer. there may be systems among the “dis- 


crepant” lenses mentioned above that in fact contain small- 
scale structure; a good example is Q2237+0305, whose 
time variable discre pancies are caused by microlensing 
dWozniak et alJ[2000l) . In other words, our accounting rep¬ 
resents a strict lower bound on the number of lenses with flux 
ratio anomalies caused by small-scale structure — and makes 
it eminently clear that such anomalies are quite common. 

Interpreting these anomalies to place constraints on the na¬ 
ture of the implied small-scale structure involves many con¬ 
siderations that are beyond the scope of this paper. No analy¬ 
sis of single-epoch, single-band photometry can determine the 
scale of the structure required to explain flux ratio anomalies, 
beyond the idea that it must be smaller than the separation 
between the images. Time variability (as in Q2237+0305) 
or differences between optical and radio flux ratios (as in 
B0712+472) may indicate microlensing, although even then 
a much more detailed an alysis is required to determine the 
microlensing scale (e.g.. iKochanekI l2004al and references 
therein). Absent such data, it is impossible for any analysis 
of broad-band photometry in individual lenses to robustly dis¬ 
tinguish between microlensing, millilensing, or intermediate- 
scale phenomenon. All three possibilities are interesting, but 
they have very different implications for astrophysics. 


Fortunately, there are excellent prospects for obtain¬ 
ing additional data that can help distinguish between the 
different hypotheses. Even apart from time variability, 
comparisons between optical continuum and broad-band 
flux ratios can establish the scale of the small-scale struc¬ 


ture dM oustakas & MetcalS l2003t IWisotzki et alJ l2003t 


Metcalf et alJ 120041 iMorgan et alJ 120041 IWayth et alJ 

2005al iKeeton et al.l l2005^. Showing th 
saddle images are affected differently 
structure might also establish the scale dlV 

at minima and 
by small-scale 

etcalf & Madaul 

20011 iSchechter & WambsganssI 120021 

KeetoiJ 120031 


Kochanek & Dalall 120041; lBrada£ et alJ l2f)0A Tfiese are 


several examples of the more general point that the size 
of the source quasar provides a scale in the problem that 
may help us determine the scale of structure in the lens 
dPohler & Keetonll2flfl.5ft . It is important to note that all of 
these approaches require significant effort to obtain, analyze, 
and interpret new data; studying all four-image lenses in this 
much detail is not feasible. It is therefore crucial to have a 
reliable way to identify lenses that warrant further study. The 
fold relation joins the cusp relation in providing precisely 
the realistic but robust method that we need for identifying 
flux ratio anomalies. As such, the two relations provide 
the foundation for studies of small-scale structure in lens 
galaxies. 


We thank Art Congdon and Greg Dobler for helpful discus¬ 
sions, and the anonymous referee for insightful comments. 
BSG was supported by a Menzel Fellowship from the Har¬ 
vard College Observatory. AOP was supported by NSF 
grants DMS-0302812, AST-0434277, and AST-0433809, and 
an MIT Martin Futher King, Jr. Visiting Professorship in 
Physics. 


APPENDIX 

A. UNIVERSAF REFATIONS FOR FOFDS 

The generic pro perties of lensing nea r a fold caustic h ave been studied before bylBlandford & Naravanldl 986h . lSchneider et all 
ill 9921 Chapter 61. iPetters et al.l (1200 ll Chapter 9), and lGaiidi & Petter^ (l2002al) . In this appendix we extend the analysis to a 
higher order of approximation. 


A.l. Local orthogonal coordinates 

Consider the lens equation y = i'-grad^/>(i). If we assume that the induced lensing map, = x-sradMx), f rom the lens 
plane to the source plane is locally stable, then the caustics of fj must be either folds or cusps dPetters et alJl200il p. 294). Fet 
us focus on a fold caustic, and translate coordinates in the lens and source planes so that the caustic passes through the origin 
/= 0 of the light source plane, while the origin i'= 0 of the lens plane maps into the origin of the light source plane. By abuse of 
notation, we still use x and y to denote the translated coordinates. 

Consider a small neighborhood Nl about the origin in the lens plane, which maps to a local region Ns about the origin in the 
source plane. We assume Ns is sufficiently small that no critical points outside Nl are mapped into Ns, and there are no cusp 
caustic points inside Ns- In other words, the only caustic in Ns is a fold arc passing through the origin. 

By Taylor expanding, we see that the Jacobian matrix of the lensing map ff is giyen at the origin x = 0 by 


where 


[Jac^(0) = 


l-2a -b 
-b l-2c 


a=-ipu(0), 


b = tpi2(0), 


c= 2^22(0). 


(Al) 

(A2) 


The subscripts indicate partial deriyatiyes of t/j relatiye to x = (xi,X 2 ). Note that ijj has no linear part (since if maps the origin to 
itself). For y = 0 to be fold caustic point, the rank of [ Jac ry] (0) must be unity, which means that we must haye (1 - 2a)( 1 -2c) -b^ = Q 
while at least one of (1 —2a), (1 -2c), and b^ does not yanish ( IPetters et alJl200ll p. 349). Consequently, (1 - 2d) and (1 -2c) 
cannot both yanish. We lose no generality by assuming 1 - 2d ^ 0. 
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Now introduce the orthogonal matrix (see lPetters et al.l200lL p. 344) 


M = 


1 

\J{l-2a)^ + b^ 


\-2a -b 
b l-2d 


and define new orthogonal coordinates in the neighborhoods Nl and Ns by 

0 = (01,02) = Mi) M*= (mi,M 2 ) = My. 


(A3) 


(A4) 


(Note that the coord inate changes are the same in the lens and source pl anes, which is different from the approach of 
[Schneider et alJfT992l p. 185.) Using these coordinates, iPetters et alJ j2001[ p. 346) showed rigorously that x = 0 is a fold 
critical point if and only if the following conditions hold: 

(l-2d)(l-2c) = P, l-2df^0, £/ =-'0222(0)^0. (A5) 

Remark: The matrix M orthogonally diagonalizes [JacTyKO). 

Let us now Taylor expand the lens potential near the origin. We argue below that carrying the expansion to fourth order in 
0 is both necessar y and sufficient for the precision we desire. The most general fourth order expansion can be written as (see 
IPetters et al.l200ll pp. 346-347) 

ipiO) = — (1 — A')0j + — 0| + e0[ + f 9\92+g0\9\ + h9\ + k0‘\ + m9\92 + n9\9\+ p 9i9\ + r 9\. (A6) 

The zeroth order term in the potential is itTelevant, so we neglect it. The first order terms must vanish in order to ensure that the 
origin of the lens plane maps to the origin of the source plane. In the second order terms, the coefficients of the 0i02 and 9\ terms 
are fixed (to 0 and 1 /2, respectively) by the conditions that the origin is a fold critical point such that [Jac77](0) is in diagonal 
form. Note that the coefficient e of t he 0? term here is d ifferent from the ellipticity parameter used in the main text. We retain 
e here to match the notation used bv IPetters et al.l(l2001l) . The e coefficient does not appear in the main text, and the ellipticity 
parameter does not appear explicitly in this appendix, so there should be little confusion. 

Conventional analyses of lensing near a fold caustic have only considered the K, e, f, g , and h terms in the expansion 
( iBlandford & Narava nl ll986l: fS chneider et 'aD ll992t fPetters et a l.l l2001l: lO audi & Pettersll2002al) . However, we shall see below 
that some of the other terms are significant for our analysis. 


A.2. Image positions and magnifications 

We seek to use perturbation theory (e.g.. lBellmanll96^ to find expansions for the image positions and magnifications that are 
accurate to first order in u. We shall work from our fourth order expansion of the lens potential, and then verify that it is adequate 
for our purposes. For bookkeeping purposes, let us introduce scalar parameter ^ by taking u^^u, so we can identify terms of a 
given order by examini ng th e power of 
For the potential eq. (IA6> . the lens equation is 

(,ui=K9i - {3e9j + 2f9i92+g9l) - {4k9l + 3m9j92 + 2n9i9l + p9l), (A7) 

= - (/ 0? + 2g 0102 + 3/r 0|) - (m 0^ + 2n 9% + 3p 0i 0f + 4r 0^) . (A8) 


Since the lowest order terms are linear or quadratic in 0, it is natural to postulate that the image positions can be written as a 
series expansion in ^ with the following form: 

0i = aiC'/" + /3iC+O(O^/", (A9) 

92 = a2e^^ + M + O(0^^\ (AlO) 

Substituting into the lens equation, we obtain: 

0 = {aiK)^^^^-{3ale + 2aia2f+alg-f3iK+ui)^ + O(0^^^, (All) 

0 = -(Q:i/+2Q!iQ;2g + 3a2/r + M2)^- \2aYl3if + 2{ai(32 + a2l3i)g + 6a2l32h + a\m + 2ala2n + 3aialp + 4alr^ + O(^)\N\2) 

It is easily understood why these equations are carried to different orders. Eq. (ET) has a term that is linear in 0, which means 
that correction terms appear at 0(0^'^^. By contrast, in eq. (|A8} the lowest order term is quadratic in 0, and since 

9i9j = aiaji + {ail3j + ajl3i)i^l^ + O{0' (A13) 

we see that the correction terms only appear at O (0^. _ _ 

Following perturbation theory, we can now solve for the unknowns a,- and Pi by demanding that eqs. (lAl 1> and JA12> be 
satisfied at each order in 0 We then find that the positions of the two images can be written as 


0^ 


3hui—gU2 

3fi^ 




ci/2 3g/rMi-(g2 + 2/:r)M2 ^ , ^^,^,3/2 
3/r ^ 9h2K ^ 


(A14) 

(A15) 






























19 


Note that the distance between the two images is 

+ (A16) 

To find the magnifications of the images, we compute the Jacobian determinant of the lens equation, and evaluate that at 9^ to 
obtain 

(/r±)"‘ = ±2Ky'-3hu2 ^ {g^-3fh + 2Kr) « 2 ?+ 0(0^^" ■ (A17) 

This result shows that the ± labels for the two images have been assigned such that > 0 while /i“ < 0. 

Remarks: (1) To check our results, we note that at lowest order we recover the same scalings d\ oc and oc found by 
previous analyses (ISchneider et alJl992HPetters et alJ200lUGaudi & Pettersl2002^ . (2) To first order in ^ the image se parati on 
and the two magnifications depend only on the U 2 component of the source position. (3) In several places we have ^/-hu 2 or 
\J-u 2 lh. In general, at least for simple lens potentials like an isothermal ellipsoid or isothermal sphere with shear, we have < 0 
all along the caustic. This means that only source positions with U 2 > 0 lead to the production of two fold images. _ 

At first order in the image positions and magnifications, the presence of r demonstrates that the fourth order terms in eq. (|A 6 j 
cannot be ignored. At the same time, we can now verify that going to fourth order is sufficient. Any term of 0{9)^ in the potential 
would enter the lens equation at 0 ( 0 )"^; that would in turn be of order or higher, which is beyond the order to which we are 
worki ng. S imilarly, terms of 0{6)^ in the potential would enter at 0(9)^ or at least In other words, going to fourth order 
in eq. is both necessary and sufficient when we seek the image positions and magnifications correct to first order in 


A.3. Generic behavior of the fold relation 

From eq. JA17t we see that the two fold images have magnifications that are equal and opposite to lowest order in which 
means that the combination - |/j,“| should approximately vanish. In observed lenses, the magnifications are not directly 
observable but the fluxes are, so to construct a dimensionless combination of the fluxes we define 


^fold = ” 




Plugging in the series expansions for /i^, we find 




2(g^-3fh + 2Kr) 2 , 


3hK 


3h 


By comparing the expansion for di in eq. (EHi, we see that we can write 


(A18) 


(A19) 


J^fold = Afold t/l + O (0 , 


(A20) 


where 


A fold = 


g^-3fh + 2Kr 


3V^?22~3V'112V'222 + '02222(I-V’1i) 

6V'222(1 -V'1i) 


(A21) 


In the last equality, we have replaced the coefficients {f,g,h,K,r) with their definitions in terms of derivatives of the potential; 
for example. 


/= 2 ^n 2=2 

1 , 1 
^=2^122=2 


d9]d92 ’ 
d^ij) 

d9id9l ’ 


(A22) 

(A23) 


and so forth, where the derivatives are evaluated at the origin 6 * = 0. 

To summarize, /?foid vanishes for a source asymptotically close to a fold caustic. For a source a small but finite distance away, 
there is a correction term that scales as the square root of the distance of the source from the caustic, or (more usefully) as the 
separation between the two fold images. The coefficient Afoy of this linear scaling depends on properties of the lens pote ntial at 
the fold critical point. In particular, the presence of r reiterates the fact that the fourth order expansion in the potential (eq. lAa is 
necessary to obtain an expansion for /?foid that is accurate at order or d\. 

We have verified all of the a pproximations in this appendix by comparing them to exact numerical solutions of the lens equation 
obtained with the software bv lKeetonlJ2001h . 


B. DATA FOR THE OBSERVED EOUR-IMAGE LENSES 

In this appendix we summarize the observational data that we use for all of the observed four-image lenses; this text comple¬ 
ments the data values given in Table 1. Eor each lens system, we need the relative positions of the lensed images in order to 
measure the separations d\ and d 2 , and also to use as constraints on lens models for determining the Einstein radius Rgin- The po¬ 
sition of the lens galaxy (or galaxies), if available, is also valuable for the lens modeling. We also need the flux ratios between the 
images in order to determine Rma- We consider radio and optical/near-infrared flux ratios separately, because they are believed to 
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correspo nd to very different source sizes and provide very different information about small-scale structure in the lens potential 
(see, e.g.. lDalal & Kochanel<l200^lDohler & Keetonl2005t) . At optical/near-inf rared wavelengths, we examine the colors of the 
images to detect (and correct for) any differential extinction that may be present. iFalco et aP (119991) carried out a detailed version 
of this analysis for a sample of lenses that includes seven that we consider, and we use their results where available. If there is no 
evidence for differential extinction, we combine data from different passbands using a weighted average. At radio wavelengths, 
we again examine the wavelength dependence of the flux ratios to determine that there are no significant electromagnetic effects. 
If there are other flux measurements, such as in optical broad emission lines or at mid-infrared wavelengths, we use those as well 
(as discussed below). __ 

0047-2808: This is a quadruply image d system ijW arren et al.lll996l Il999t) . but its lack of pointlike images means that it 
requires special modeling techniques (e.g.. lWavth et alJ l2nn.5bll . and that it is probably not very interesting for the analysis of 
small-scale structure anyway. We do not include it in our sample. 

B0128-h437: For the image positions, we use the radio astrometry fro m iPhillips et all ( 20001). For the radio fluxes, we use 
the mean and scatter in Rfou from 41 epochs of MERLIN monitoring bv iKoonmans et al. (l2003a|l . The monitoring shows no 
evidence for time dependence. 

HE 0230-2130: We use the optical HST astrometry from CASTLES. There are two lens galaxies; we include both of them 
in lens models, taking their observed positions as constrai nts but optimiz i ng the ir masses. For the optical fluxes of the lensed 
images, we use the BRI data for images A, B, and C from lWisotzki^^aI] j[j_999h. The colors are consistent with no differential 
extinction. (Image D is not well separated from galaxy G2 in the lWisotzki^alJ data. so we do not consider it.) 

MG 041 4-H0534: We use the optica l HST astrometry from CASTLES. We include the satellite galaxy near the lens galaxy 
(“o bject X”ISchechter & Moorelll993ll in lens models. Eor the optical image fluxes, w e use the extinction -corrected flux ratios 
frorn iFalco et al.l(ll999l) . Eor the radio fluxes, we use the high-resolution VLBI data from iTrotter et alJ ( 120001) . Those observations 
resolve each image into four subcomponents; the Rfou value s are similar for the diffe rent subcomponents, so we take the weighted 
average. The radio flux ratios are constant in time to 1-3% (iMoore & Hewittll997h . 

H E 0435-1223: We use t he optical HST astrom etry from CASTLES. For the optical broad-band fluxes, we use the gri data 
from lWisotzki et akl (|2^3)- IWisotzki et alJ ( l2003l) also report emission line fluxes; we take the mean and scatter in Rfoid from 
the C IV and C III] lines. There is no evidence for wavelength dependence in the broad-band flux ratios, and the spectra of the 
different images have identical spectral slopes, so there does not appear to be any differential extinction. Image D appeared to 
vary by 0.07 mag between the two sets of observations, which may imply evidence for microlensing. 

B0712-h472: We use the optical HST astrometry and photometry from CASTLES. The values of Rfou differ slightly in the V, 
I, and H bands, but within the measurement uncertainties; hence there is no evidence for different ial extinction. Eor the radio 
fluxes, we use the mean and scatter in Rfoid from MERLIN monitoring bv iKoopmans et aP (12003 ah . There is evidence for time 
dependence in the radio fluxes. 

RX J0911-H0551: We use the optical HST astrometry and photometry from CASTLES. The lens galaxy has a satellite galaxy, 
which we include in lens models. The image flux ratios vary with wavelength in a m anner that is consisten t with differential 
extinction, so we cor rect for extinction us ing a redshifted Ry = 3A extinction curve from ICardelli et al1(ll989l also see Paper I). 

SDSS 0924-h0219: iKeeton et al .1 (1200.51) report image positions, broad-band flux ratios, and broad emission line flux ratios from 
HST observations. We use the weighted average of the V and I broad-band flux ratios. We use th e emission line flux es with 5% 
uncertainties, which is probably conservative. The best color information comes from gri data bv llnada et alJ ll2003ll . which are 
consistent with no differential extinction. __ 

SDSS 1004-h4112: This lens is produced by a cluster r ather than a single g alaxy (iQgiiri et'^ |2004l ), but we can stiU treat it 
with our formalism. We use ground-bas ed griz data fro in|Qguri et al.l (l2004i) . and HST/I data from llnada et al ] il2004h . There 
is no eyidence for differential extinction. iRicha^rds et al.l ( l2004l) claimed to obserye microlensing of the broad emission lines in 
image A, but the leyel of yariability in the continuum is not yet known. 

PG 1 115-H080: We use t he HST astrometry and photometry from CASTLES, and the mid-infrared flux ratios from IChiha et al.l 
d2005l) . iFalco et ^ (Il999l) find that the VIH data are c onsistent with no differential extinction. In the lens models, we explicitly 
include the group of galaxies surrounding the lens (see lKeeton & Kochanekll9 97HlmDey et alJl 998h . 

RX J1131-1231: We use the ground-based astrometry and nhotometrv from IShise et alJil2003h . They report two epochs of V 
data and one epoch of R. The colors are consistent with no differential extinction. The total flux varied between the two epochs, 
but the flux ratios remained constant. _ _ 

HST 12531-2914: We use the HST astrometry and photometry from iRatnatiinga et al.l ill99.5h and CASTl .ES. IFalco et al.l 
d 19991) find that the V-I colors are consistent with no extinction (within th e noise). 

HST 14113-1-5211: We use the HST astrometry and photometry from iFischer et akl dl998l) and CASTLES. There is some 
scatter among the values of Rfoid obtained from V-, R-, and I-band data, but the scatter is within the (fairly large) measurement 
uncertainties. 

H 1413-I-117: We use the HST astrometry from CASTLES. Eor the optical fluxes, we use the extinction-corrected flux ratios 
from iFalco et al.ldl999h . 

HST 14176-1-5226: We use the HST astrometry and photometry from lRatnatunga et al.ldl99-5h . IFalco et alJdl999t) find that the 
colors are consistent with no differential extinctiom _ 

B1422-I-231: We use th e radio data from iPatnaik et al.1 dl999h . The radio fluxes are basi cally constant in time 
dPatnaik & Narasimhall200l1) . For the optical fluxes, we use the extinction-corrected flux ratios from IFalco et al HHH). We 
also use the mid-infrared flux ratios betwee n images A, B, and C (image D was not detected) from IChiba_g l_alJj[20f)^r 

B1555-I-375: We use the radio data from iMarlow et all dl999h . The data from radio monitoring bv iKoonmans et al.l d2n03ah 
yield similar results, but have larger formal errors. 
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B1608+656: iFassnacht et al.l J2002h monitored the radio fluxes, measured the time delays, and determined the delay-corrected 
magniflcation ratios; we ta ke the mean and scatter in Rfou from their three seasons of data. There are two lens galaxies; we model 
the system using data from lKoonmans et allfcOOShh . 

B1933- H503: There are te n lensed images associated with three different sources. We use all of the images in lens modeling, 
following ICohn et'^il2flfllll . However, for the fold analysis we use only the fold quad consi sting of images 1/3/4/6. For the radio 
fluxes, we first take the mean and scatter from 8.4 GHz monitoring bylRiggs et al.l (l2000l) . and then combine that in weighted 
average with measurements at other wavelengths bylS^kes_et^yj|1998L, 

WFI 2026-4536: We use the optical data from lMorguTetal.1 (l2004l) . We use all available data in which the images are 
resolved; ugriHKs plus HST/F160W for images B and C; and iHKs plus HST/F160W for images Ai and A 2 . There is some 
wavelength dependence that may suggest differential extinction or microlensing, the current data are inconclusive. We simply 
take the mean and scatter in from all of the data. _ 

WFI 2033-4723: We use the optical data from lMorgan et al.1(l2004ll . For the optical broad-b and flux ratios , we use all available 
data in which the images are resolved; ugri for images B and C; and ri for images Ai and Ai. lMorgan et al.l also report emission 
line flux ratios; we take the weighted average of Rfnu from the C IV, C III], and Mg II lines. 

B2045-h265: We use th e radio positions fronjFassnacht^ For the radio fluxes, we combine various measurements 

by IFassnacht et al] (119991) and monitoring bv lKooDman^eta^(l2003a^ ■ and take the mean and scatter in Rfoia- For the optical 
fluxes, we use HST data from CASTLES for images A, B, and C (image D was not detected). The VIH colors are consistent with 
no differential extinction. 

O2237-H0 305: We use HST as trometry from CASTLES. Eor the broad-band optical fluxes, we use the microlensing light 
curves from lWozniak et al.l(l2000l) . We correct for differ ential extinct ion using the redd ening deduced bv lFalco et al.lifl999ll . For 
the radio fluxes, we use the data from iFalco et alJ lll996ll . In addition. lAgol et all (l2000l) report mid-infrared flux ratios measured 
at 8.9 pm and 11.7 pm. 
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Table B 1. Lens Data 



Image Pair 

di 

Rfoid (optical) 

Rfoid (radio) 

Rfoid (other) 

References 

B0128+437 

AB 

0.14 


0.263 ±0.023 


Koonmans et al. f2003a) 

Re.„=0.20 

AD 

0.27 


0.328 ±0.028 



fold 

CD 

0.42 


0.014 ±0.042 




CB 

0.50 


-0.058 ±0.037 



HE0230-2130 

AB 

0.74 

0.000 ± 0.008 



Wisotzki et al. 119991 

i?e.„=0.82 

CD 

1.46 





fold 

AD 

1.64 






CB 

1.65 

-0.289 ± 0.007 




MG0414+0534 

A 1 A 2 

0.41 

-0.024 ±0.038 

0.085 ±0.002 


Ealco et al. ^9991. 

«em=108 

BA 2 

1.71 

-0.500 ±0.043 

-0.477 ± 0.004 


Trotter et al. ^fJ^Ol 

fold 

A|C 

1.96 

0.739 ±0.015 

0.736 ±0.003 




BC 

2.13 

0.400 ±0.046 

0.323 ±0.007 



HE0435-1223 

CB 

1.53 

-0.029 ±0.014 


-0.035 ±0.010 

Wisotzki et al. 120021. 

=1.18 

AB 

1.59 

0.226 ±0.004 


0.136 ±0.005 

Wisotzki et al. [2003) 

cross 

CD 

1.85 

0.049 ±0.019 


0.137 ±0.007 



AD 

1.88 

0.299 ±0.012 


0.300 ±0.011 


B0712+472 

AB 

0.17 

0.519 ±0.052 

0.085 ±0.036 


Kooomans et al. (2003a). 

Rein =0.68 

CB 

0.91 

-0.123 ±0.075 

-0.337 ±0.051 


CASTLES 

fold/cusp 

CD 

1.18 

0.361 ±0.062 

0.672 ±0.120 




AD 

1.25 

0.636 ±0.062 

0.848 ±0.060 



RXJ0911+0551 

BA 

0.48 

0.027 ±0.013 



CASTLES 

^cin = 0.95 

BC 

0.62 

0.303 ±0.012 




cusp 

DC 

2.96 

-0.137±0.016 





DA 

3.08 

-0.400 ±0.014 




SDSS0924+0219 

AD 

0.69 

0.873 ±0.002 


0.821 ±0.012 

Keeton et al. [2005) 

Rein =0.87 

AC 

1.18 

0.483 ±0.003 


0.696 ±0.019 


fold 

BD 

1.46 

0.751 ±0.002 


0.593 ±0.023 



BC 

1.53 

0.149 ±0.002 


0.363 ±0.031 


SDSS1004+4112 

AB 

3.73 

0.194 ±0.015 



Osuri et al. (20041. 

Rein = 6.91 

DB 

11.44 

-0.512±0.017 



Inada et al. ^0041 

fold 

AC 

11.84 

0.401 ±0.011 





DC 

14.38 

-0.312±0.016 




PGl 115+080 

A 1 A 2 

0.48 

0.215±0.011 


0.036 ±0.032 

CASTLES, 

Rem =1.03 

AiB 

1.67 

0.722 ±0.009 


0.724 ±0.104 

Chiba et al. [2005) 

fold 

CB 

1.99 

0.214±0.019 


0.135±0.234 



CA 2 

2.16 

-0.445 ±0.011 


-0.632 ±0.060 


RXJl 131-1231 

BA 

1.19 

0.209 ±0.013 



Sluse et al. [2003) 

Rem =1.81 

CA 

1.26 

-0.272 ±0.019 




cusp 

BD 

3.14 

0.824±0.012 





CD 

3.18 

0.587 ±0.026 




HST12531-2914 

BC 

0.77 

-0.092 ± 0.057 



Ratnatunea et al. 119951. 

Rem = 0.55 

AC 

0.78 

-0.187 ±0.046 



CASTLES 

cross 

BD 

0.91 

0.164 ±0.089 





AD 

1.02 

0.015 ±0.078 




HST14113+5211 

CD 

1.13 

0.138 ±0.049 



Fischer et al. (1991^1. 

Rem =0.83 

CB 

1.38 

0.287 ±0.109 



CASTLES 

cross 

AD 

1.41 

0.128 ±0.049 





AB 

1.42 

0.305 ±0.116 




H1413+117 

AB 

0.76 

0.031 ±0.016 



Ealco et al. ^g991 

Rein =0.56 

AC 

0.87 

0.205 ±0.015 




cross 

DC 

0.91 

-0.056 ±0.023 





DB 

0.96 

-0.229 ± 0.022 




HST 14176+5226 

CB 

1.73 

0.088 ± 0.040 



Ratnatunua et al. [1995) 

Rein = 1.33 

AB 

2.09 

0.163 ±0.040 




cross 

CD 

2.13 

0.089 ±0.043 





AD 

2.13 

0.164 ±0.040 




B1422+231 

AB 

0.50 

-0.038 ±0.018 

-0.038 ±0.007 

0.031 ± 0.027 

Ealco et^^^9991. 

Rein =0.76 

CB 

0.82 

-0.317 ±0.020 

-0.339 ±0.006 

-0.245 ±0.055 

Pafnaik et al. [199f)). 

cusp 

AD 

1.25 

0.942 ±0.019 

0.936 ±0.006 


Chiha et al. 120051 


CD 

1.29 

0.898 ±0.032 

0.884 ±0.011 



B1555+375 

AB 

0.09 


0.274 ± 0.003 


Marlow et al. (19991. 

Rem =0.24 

CB 

0.35 


-0.084 ±0.004 


Kooomans et al. {'2003a) 

fold 

AD 

0.40 


0.858 ±0.006 




CD 

0.42 


0.725 ±0.010 



B1608+656 

AC 

0.87 


0.321 ±0.006 


Fassnacht et al. (2002) 

Rem =0.77 

BC 

1.51 


-0.016 ±0.002 



fold 

AD 

1.69 


0.706 ±0.004 




BD 

2.00 


0.486 ±0.004 



B1933+503 

4-3 

0.46 


0.656 ± 0.007 


Svkes et al. 119981. 

Rem =0.49 

4-6 

0.63 


0.637 ±0.007 


Bieas et al. 120001 

fold 

1-3 

0.90 


0.143 ±0.014 




1-6 

0.91 


0.111±0.013 



WEI2026-4536 

A 1 A 2 

0.33 

0.181 ±0.043 



Moruan et al. (2004) 

Rein = 0.65 

A|C 

0.83 

0.626±0.015 




fold 

BC 

1.19 

-0.431 ±0.014 





BA 2 

1.28 

0.096 ±0.011 
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Table Bl. Lens Data — Continued 


Image Pair d{ ^foid (optical) /?foid (radio) /?foid (other) References 


WH2033-4723 

Ran = 106 
fold 

A 1 A 2 

A|C 

BA 2 

BC 

0.72 

1.54 

2.01 

2.13 

0.219±0.010 

0.330 ±0.007 
-0.072 ±0.012 
0.077 ± 0.044 


0.174 ±0.099 

0.056 ±0.074 
-0.042 ±0.091 
-0.161 ±0.057 

^oraan et al- (20U4) 

B2045+265 

AB 

0.28 

0.255 ±0.017 

0.287 ± 0.020 


Fassnacht et al. (1999). 

«em = 1.13 

CB 

0.56 

0.153 ±0.023 

0.133 ±0.045 


CASTLES 

cusp 

AD 

1.91 


0.809 ±0.022 




CD 

1.93 


0.750 ±0.033 



Q2237+0305 

AD 

1.01 

(variable; 

0.130±0.145 

-0.008 ±0.068 

Wozniak et al. (2000). 

i?e,„=0.85 

BD 

1.18 

see text) 

0.172±0.139 

0.048 ±0.062 

Falco et"^ (19961. 

cross 

AC 

1.37 


0.289 ±0.170 

0.270 ±0.079 

Aaol et al. (20001 


BC 

1.40 


0.328 ±0.163 

0.319 ±0.072 



Note. — The lengths Rein (Column 1) and d\ (Column 3) are given in arcseconds. We do not explicitly quote d 2 , because it can be determined from the 
other di values. For example, in B0128+437 the value of d 2 for image pair AB would be the smaller of d\ (AD) and d[ (CB). CASTLES denotes the CfA/Arizona 
Space Telescope Lens Survey (see http://cfa-www.harvard.edu/castles). 














